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LINKED PEPTIDE NUCLEIC ACIDS 

CROSS REFERENCE TO RELATED APPLICATIONS 

This patent application is a continuation-in-part of 
United States patent application Serial No. 08/275,951, filed 
5 July 15, 1994, which is a continuation-in-part of United States 
patent application Serial No. 08/108,591, filed Aug. 27, 1993, 
which is the U.S. national phase of international patent 
application PCT/EP/01219, filed May 22, 1992, which claims the 
priority benefit of the following Danish patent applications: 

10 No. 986/91, filed May 24, 1991, No. 987/91, filed May 24, 1991, 
and No. 510/92, filed April 15, 1992. Application Serial No. 
08/275,951 also is a continuation-in-part of United States 
patent application Serial No. 08/088,658, filed July 2, 1993, 
and United States patent application Serial No. 08/088,661, 

15 filed July 2, 1993. The entire disclosure of each of the 
foregoing patent applications is incorporated herein by 
reference . 



. FIELD OF THE INVENTION 

This invention is directed to compounds that are not 
20 polynucleotides yet which bind to complementary DNA and RNA 
scrands more strongly than corresponding polynucleotides. In 
particular, the invention concerns novel peptide nucleic acid 
compounds and novel linked peptide nucleic acid compounds 
wherein naturally-occurring nucleobases or other nucleobase- 
25 binding moieties are covalently bound to a polyamide backbone 
which is covalently linked via a linking moiety to a second 
similarly substituted polyamide backbone. 
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BACKGROUND OF THE INVENTION 

Oligonucleotides and their analogs have been 
developed and used in molecular biology in certain procedures 
as probes, primers, linkers, adapters, and gene fragments. 
5 Modifications to oligonucleotides used in these procedures 
include labeling with non isotopic labels, e.g. fluorescein, 
biotin, digoxigenin, alkaline phosphatase, or other reporter 
molecule. Other modifications have been made to the ribose 
phosphate backbone to increase the nuclease stability of the 

10 resulting analog. These modifications include use of methyl 
phosphonates , phosphorothioates , phosphorodithioate linkages, 
and 2'-0-methyl ribose sugar units. Further modifications, 
include modification made to modulate uptake and cellular 
distribution. Phosphorothioate oligonucleotides are presently 

15 being used as antisense agents in human clinical trials for 
various disease states including use as antiviral agents . With 
the success of these oligonucleotides for both diagnostic and 
therapeutic uses, there exists an ongoing demand for improved 
oligonucleotide analogs. 

20 Oligonucleotides can interact with native DNA and RNA 

ir. several ways. One of these is duplex formation between an 
oligonucleotide and a single stranded nucleic acid. The other 
is triplex formation between an oligonucleotide and double 
stranded DNA to form a triplex structure; however, to form a 

2 5 triplex structure with a double stranded DNA, the cytosine 
bases of the oligonucleotide must be protonated. This thus 
renders such triplexing pH dependent. P.O. P. Ts'o and 
associates have used pseudo isocytosine as a permanently 
prctonated analogue of cytosine in DNA triplexing (see Ono, et 

30 a:., J. Am. Chem. Soc, 1991, 113, 4032-4033; Ono, et. al . , J . 
Org. Chem., 1992, 57, 3225-3230). Trapane and Ts'o have also 
suggested the us of pseudo isocytosine for triplex formation 
wi-h singe -stranded nucleic acid targets, (see, Trapane, et. 
al., J. Biomol. Strul. Struct., 1991, 8, 229; Trapane, et. al . , 

35 Eiophys. <J. , 1992, 61, 2437; and Trapane, et. al . , Abstracts 
Ccxference on Nucleic Acids Medical Applications, Cancun, 
Mexico, January 1993) . 8-Oxoadenine was also suggested in 
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patent application WO 93/05180 for protonated cytosine in 
triplex formation. 

Peptide nucleic acids are compounds that in certain 
respects are similar to oligonucleotide analogs however in 
5 other very important respects their structure is very 
different. In peptide nucleic acids, the deoxyribose phosphate 
backbone of oligonucleotides has been replaced with a backbone 
more akin to a peptide than a sugar phosphodiester . Each 
subunit has a naturally occurring or non naturally occurring 

10 base attached to this backbone. One such backbone is 
constructed of repeating units of N- (2 -aminoethyl) glycine 
linked through amide bonds. Because of the radical deviation 
from the deoxyribose backbone, these compounds were named 
peptide nucleic acids (PNAs) . 

15 PNA binds both DNA and RNA to form PNA/DNA or PNA/RNA 

duplexes. The resulting PNA/DNA or PNA/RNA duplexes are bound 
with greater affinity than corresponding DNA/DNA or DNA/RNA 
duplexes as determined by Tm's. This high thermal stability 
might be attributed to the lack of charge repulsion due to the 

20 neutral backbone in PNA. The neutral backbone of the PNA also 
results in the Tm's of PNA/DNA (RNA) duplex being practically 
independent of the salt concentration. Thus the PNA/DNA duplex 
interaction offers a further advantage over DNA/DNA duplex 
interactions which are highly dependent on ionic strength. 

25 Homopyrimidine PNAs have been shown to bind complementary DNA 
or RNA forming (PNA) 2 /DNA (RNA) triplexes of high thermal 
stability (see, e.g., Egholm, et al., Science, 1991, 254, 1497; 
' Eaholm, et al. 9 J. Am. Chem. Soc, 1992, 114, 1895; Egholm, et 
al. t J. Am. Chem. Soc, 1992, 214, 9677). 

30 In addition to increased affinity, PNA has also been 

shown to bind to DNA with increased specificity. When a 
PNA/DNA duplex mismatch is melted relative to the DNA/DNA 
duplex there is seen an 8 to 20 °C drop in the Tm. This 
magnitude of a drop in Tm is not seen with the corresponding 

35 DKA/DNA duplex with a mismatch present. 

The binding of a PNA strand to a DNA or RNA strand 
can occur in one of two orientations. The orientation is said 
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to be ant i -parallel when the DNA or RNA strand in a 5' to 3' 
orientation binds to the complementary PNA strand such that the 
carboxyl end of the PNA is directed towards the 5' end of the 
DNA or RNA and amino end of the PNA is directed towards the 3' 
5 end of the DNA or RNA. In the parallel orientation the 
carboxyl end and amino end of the PNA are just the reverse with 
respect to the 5' -3' direction of the DNA or RNA. 

PNAs bind to both single stranded DNA and double 
stranded DNA. As noted above, in binding to double stranded 

10 DNA it has been observed that two strands of PNA can bind to 
the DNA. While PNA/DNA duplexes are stable in the antiparallel 
configuration, it was previously believed that the parallel 
orientation is preferred for (PNA) 2 /DNA triplexes. 

The binding of two single stranded pyrimidine PNAs 

15 to a double stranded DNA has been shown to take place via 
strand displacement, rather than conventional triple helix 
formation as observed with triplexing oligonucleotides. When 
PNAs strand invade double stranded DNA, one strand of the DNA 
is displaced and forms a loop on the side of the PNA 2 /DNA 

20 complex area. The other strand of the DNA is locked up in the 
(PNA) 2 /DNA triplex structure. The loop area (alternately 
referenced as a P loop) being single stranded, is susceptible 
to cleavage by enzymes that can cleave single stranded DNA. 

A further advantage of PNA compared to 

25 oligonucleotides is that their polyamide backbone (having 
appropriate nucleobases or other side chain groups attached 
thereto) is not recognized by either nucleases or proteases and 
are not cleaved. As a result PNAs are resistant to degradation 
by enzymes unlike DNA and peptides. 

30 Because of their properties, PNAs are known to be 

useful in a number of different areas. Since PNAs having 
stronger binding and greater specificity than oligonucleotides, 
they are used as probes in cloning, blotting procedures, and 
in applications such as fluorescence in situ hybridization 

35 (FISH) . Homopyrimidine PNAs are used for strand displacement 
in homopurine targets. The restriction sites that overlap with 
or are adjacent to the P-loop will not be cleaved by 
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restriction enzymes. Also, the local triplex inhibits gene 
transcription. Thus in binding of PNAs to specific restriction 
sites within a DNA fragment, cleavage at those sites can be 
inhibited. Advantage can be taken of this in cloning and 
5 subcloning procedures. Labeled PNAs are also used to directly 
map DNA molecules. In effecting this, PNA molecules having a 
fluorescent label are hybridized to complementary sequences in 
duplex DNA using strand invasion. 

PNAs have further been used to detect point mutations 

10 in PCR -based assays (PGR clamping) . PCR clamping uses PNA to 
detect point mutations in a PCR- based assay, e.g. the 
distinction between a common wild type allele and a mutant 
allele, in a segment of DNA under investigation. A PNA 
oligomer complementary to the wild type sequence is 

15 synthesized. The PCR reaction mixture contains this PNA and 
two DNA primers, one of which is complementary to the mutant 
sequence. The wild type PNA oligomer and the DNA primer 
compete for hybridization to the target. Hybridization of the 
DNA primer and subsequent amplification will only occur if the 

20 target is a mutant allele. With this method, one can determine 
the presence and exact identity of a mutant. 



OBJECTS OF THE INVENTION 

It is an object of this invention to provide 
compounds that bind ssDNA, dsDNA and ssRNA nucleic acids to 
25 form complexes with improved thermal stability, specificity, 
and other properties relative to corresponding DNA. 

It is a further object of this invention to provide 
compounds that bind nucleic acids via strand invasion using two 
sequences of PNA which may be linked together to form a bis PNA 
30 wherein one strand binds anti-parallel relative to the target 
utilizing Watson/Crick type hydrogen bonds and the second 
strand binds parallel relative to the target utilizing 
Hoogsteen type hydrogen bonds. 

It is a further object of this invention to provide 
35 PNAs and bis PNAs wherein C-pyrimidine heterocyclic bases or 
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iso pyrimidine heterocyclic bases are substituted in place of 
at least one pyrimidine heterocyclic base. 

It is a further object of this invention to provide 
compounds that bind nucleic acids via strand invasion using two 
5 sequences of PNA which may be linked together wherein the 
cytosines of the parallel strand relative to the target have 
been replaced with pseudo isocytosines to form a bis PNA 
wherein one strand binds ant i -parallel relative to the target 
forming Watson/Crick type hydrogen bonds and the second strand 

10 binds parallel relative to the target forming Hoogsteen type 
hydrogen bonds. 

It is a further object of this invention to provide 
bis PNA structures wherein the cytosine nucleobases are 
replaced with pseudo isocytosines in the Hoogsteen strand. 

15 It is a further object of this invention to provide 

therapeutic, diagnostic, and prophylactic methods that employ 
such compounds . 

SUMMARY OF THE INVENTION 

The present invention is directed to modified peptide 

20 nucleic acids especially PNAs that are linked via a linking 
segment. Such PNAs have been given the short hand name "bis 
peptide nucleic acids" or "bis PNAs." The present invention 
is also directed to modified peptide nucleic acids that 
incorporate certain non-natural nucleobases for Hoogsteen type 

25 base paring. These modified peptide nucleic acids are 
particularly useful for diagnostic uses, including the 
' identification of certain sites in double stranded DNA, 
restriction enzyme sites, transcription inhibition, clamping 
to detect point mutations and for use in Hoogsteen strands in 

30 triplexing motif. 

In accordance with this invention there are provided 
compounds that include a peptide nucleic acid that has at least 
one peptide nucleic acid monomeric unit having a pyrimidine 
heterocyclic base that is a C-pyrimidine heterocyclic base or 

35 an iso-pyrimidine heterocyclic base. In certain preferred 
embodiments of this invention the pyrimidine heterocyclic base 
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is a C-pyrimidine heterocyclic base. In other preferred 
embodiments of this invention the pyrimidine heterocyclic base 
is pseudo-isocytosine . In a further embodiment of the 
invention the C-pyrimidine heterocyclic base is pseudo-uracil, 
5-bromouracil, iso-cytosine or other iso-pyrimidine 
heterocyclic base. 

Compounds of the invention, including compounds 
having C-pyrimidines and iso-pyrimidine heterocyclic bases, 
include compounds of formula I : 

L 1 L 2 I" 



> v ^v*V 2 



10 wherein: 

n is at least 2, 

each of L x -L n is independently selected from the 
group consisting of hydrogen, hydroxy, (Ci-C 4 ) alkanoyl, 
naturally occurring nucleobases, non-naturally occurring 

15 nucleobases, aromatic moieties, DNA intercalators, nucleobase- 
binding groups, heterocyclic moieties, and reporter ligands; 

each of tf-C" is (CR 6 R 7 ) y where R fi is hydrogen and R 7 
is selected from the group consisting of the side chains of 
nacurally occurring alpha amino acids, or R 6 and R 7 are 

20 independently selected from the group consisting of hydrogen, 
<C : -C € )alkyl, aryl, aralkyl, heteroaryl, hydroxy, (C x -C 6 ) alkoxy, 
(C--C € ) alkylthio, NR 3 R 4 and SR 5 , where R 3 and R 4 are each 
independently selected from the group consisting of hydrogen, 
(C-CJalkyl, hydroxy- or alkoxy- or alkylthio-substituted (C x - 

25 C 4 ;alkyl, hydroxy, alkoxy, alkylthio and amino, and R 5 is 
hydrogen, (C^Cg) alkyl, hydroxy-, alkoxy-, or alkylthio- 
substituted (Ci-C^alkyl, or R 6 and R 7 taken together complete 
an alicyclic or heterocyclic system; 

each of D X -D B is (CR 6 R 7 ) 2 where R 6 and R 7 are as 

30 defined above; 
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each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 1 but not more than 10; 

each of G 1 -G n_1 is -NR 3 CO-, -NR 3 CS-, -NR 3 S0- or 
-NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 

each of A 1 -A n and B x -B n are selected such that: 

(a) A is a group of formula (11a), (lib), (lie) or 
(lid) , and B is N or R 3 N* ; or 

(b) A is a group of formula (lid) and B is CH; 



10 



(Ha) 



(lib) 




where : 

X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) 2 ; 

Y is a single bond, O, S or NR 4 ; 
each of p and q is zero or an integer from 1 to 5, the sum p+q 
15 being not more than 10; 

each of r and s is zero or an integer from 1 to 5, 
the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from the 
group consisting of hydrogen, (Ci-Cjalkyl which may be 
20 hydroxy- or alkoxy- or alkylthio- substituted, hydroxy, alkoxy, 
alkylthio, amino and halogen; and 

each R 3 and R 4 are as defined above; 

Q is -C0 2 H, -CONR'R'', -S0 3 H or -S0 2 NR'R' ' or an 
activated derivative of -C0 2 H or -S0 3 H; and 
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I is -NHR"'R"" or -NR' ' ' C(0)R' ' ' ' , where R' , R w , 
R' ' ' and R' ' ' ' are independently selected from the group 
consisting of hydrogen, alkyl, amino protecting groups, 
reporter ligands, intercalators, chelators, peptides, proteins, 
carbohydrates, lipids, steroids, nucleosides, nucleotides, 
nucleotide diphosphates, nucleotide triphosphates, oligonucleo- 
tides, oligonucleosides and soluble and non-soluble polymers. 

Peptide nucleic acids compounds of the invention 
further include compounds of structure III, IV or V: 



0- CCH 2 ) f 




0 ; 



R h Ji :CH ^C^ ; } N \J :CH 2. 




0 



R 



7 ' 



H 



ii i I 



n 



" NH- H' 



III 



(CH,) 




2 or 



KM, 




R 



NR 



> i 



N' 
H 



(CH 2 ) 
NR 3 



V ^ N H - R 



t n 



IV 



R 



10 wherein: 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
naturally occurring nucleobases, and non-naturally occurring 
nucleobases; 

15 each R 7 ' is independently selected from the group 

consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 
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V 



n is an integer greater than 1 # 

each k, 1, and m is, independently, zero or an 
integer from 1 to 5; 

each p is zero or 1; 
5 R h is OH, NH 2 or -NHLysNH 2 ; and 

R 1 is H or C0CH 3 . 

Further in accordance with this invention there are 
provided compounds having a first and a second peptide nucleic 
acid segments that are joined together via at least one linking 
10 segment that is not a peptide nucleic acid or an 
oligonucleotide . 

In preferred embodiments of the invention, the 
linking segment includes a linear structure having a carboxylic 
acid functional group on one end thereof and a primary amino 
15 functional group on the other end thereof. Preferred linking 
segments includes at least one unit of the structure: 

- [HN-Z-COO] n - 

wherein n is 1 to 3; and Z is C l -C 2 c alkyl, C 2 -C 20 alkenyl, C 2 -C 20 
alkynyl, C x -C 20 alkanoyl having at least one O or S hetero atom, 
20 C : -C 17 aryl, or C 7 -C 34 aralkyl . 

In a more preferred embodiment, the linking segment 
includes at least one aminoalkylcarboxylic acid of the formula: 

-NH- (CH 2 ) c -C<=0) - 

where e is 1 to 15. 
25 In certain preferred embodiments e is from 4 to 8 . 

In a more preferred embodiment e is 5 or 6 . 
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In other preferred embodiments, the linking segment 
includes structures of the immediate above formula and at least 
one further a-amino acid such that they are of formula: 

- (AA) h - [NH~ <CH 2 ) e -C{=0) - <AA) f ] g - 

5 where : 

AA is an a-amino acid; 
e is 4 to 8; 
f and h are 0 or 1; and 
g is 1 to 4. 

10 In further preferred embodiments, the linking segment 

includes at least one unit of a glycol amino acid. The glycol 
amino acid is formed of glycol sub-units linked together in a 
linear array and having an amino group on one terminus and a 
carboxyl group on the other terminus. Preferred glycol amino 

15 acid linking segments are compounds of the formula: 

- [NH- (CH 2 -CH 2 -0-) r CH 2 -C(=0) -] A 
wherein j is 1 to 6; and i is 1 to 6, In one particularly 
preferred embodiment, j is 2 and i is 3. 

In a further embodiment of the invention, both of the 
20 ends of two respective peptide nucleic acid segments are joined 
together via two of the linking segments to form a cyclic 
structure. 

In a further embodiment of the invention, the linking 
segment connects a terminal amine function on one of first and 

25 second peptide nucleic acid segments to a carboxyl function on 
the other of first and second peptide nucleic acid segments. 

In certain preferred embodiments of the invention, 
the nucleobase sequence of the first peptide nucleic acid 
segment, in a direction from its amine terminus to its carboxyl 

30 terminus, is the same as the nucleobase sequence of the second 
peptide nucleic acid segment, in a direction from its carboxyl 
terminus to its amine terminus. 

In other embodiments of the invention, at least a 
portion of the nucleobases of the first and second peptide 

35 nucleic acid segments are pyrimidine nucleobases. In a further 
embodiment of the invention, at least one of the pyrimidine 
nucleobases of one of the first or the second peptide nucleic 
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acid segments comprises a C-pyrimidine heterocyclic base or an 
iso-pyrimidine heterocyclic base. In a further embodiment of 
the invention, a portion of the nucleobases that are pyrimidine 
nucleobases are located in contiguous homopyrimidine sequences. 
5 Compounds of the invention also include multiple 

stranded structures having a nucleic acid strand, at least a 
portion of which forms a target nucleotide sequence, and a 
further strand, formed from first and second peptide nucleic 
acid segments that, in turn, are joined together via a linker. 

10 The sequence of the nucleobases of the first peptide nucleic 
acid segment is selected to be complementary to the target 
nucleotide sequence in the 5' to 3' direction of the target 
nucleotide sequence and the sequence of the nucleobases of the 
second peptide nucleic acid segment is selected to be 

15 complementary to the target nucleotide sequence in the 3' to 
5' direction of the target nucleotide sequence. 

In certain embodiments of the invention the nucleic 
acid strand is a single stranded DNA or RNA and in further 
embodiments of the invention the nucleic acid strand is a 

2 0 double stranded DNA. 

In still a further embodiment of the invention one 
of the first or second peptide nucleic acid segments binds to 
the target nucleotide sequence utilizing Watson/Crick type 
hydrogen bonding and the other of the first or second peptide 

25 nucleic acid segments binds to the target nucleotide sequence 
utilizing Hoogsteen type hydrogen bonding. In a preferred 
embodiment, the one of the first or second peptide nucleic acid 
segments that binds to the target nucleotide sequence utilizing 
said Hoogsteen hydrogen bonding includes C-pyrimidine 

30 heterocyclic nucleobases or iso-pyrimidine heterocyclic 
nucleobases in at least one of the positions that are 
complementary to nucleobases in the target nucleotide sequence. 
In certain preferred embodiments the C-pyrimidine heterocyclic 
nucleobase or iso-pyrimidine heterocyclic nucleobase are 

35 selected as pseudo-isocytosine, iso-cytosine, pseudo-uracil or 
5-bromouracil . 
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Compounds of the invention also include a compound 
having a first segment of joined peptide nucleic acid units 
having a first sequence of nucleobases and a second segment of 
joined peptide nucleic acid units having second sequence of 
5 nucleobases and a linker group linking the first and the second 
segments of peptide nucleic acid units. The first segment of 
peptide nucleic acid units extends from an amino end to a 
carboxyl end and the second segment of peptide nucleic acid 
units extends from an amino end to a carboxyl end with the 
10 linker group linking the carboxyl end of the first segment of 
peptide nucleic acid units to the amino end of the second 
segment of peptide nucleic acid units. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The numerous objects and advantages of the present 
15 invention may be better understood by those skilled in the art 
by reference to the accompanying figures, in which: 

Figure 1 shows a synthetic scheme according to the 
invention and discussed in Example 26. 

Figure 2 shows a synthetic scheme according to the 
20 invention and discussed in Example 33. 

Figure 3 shows a synthetic scheme according to the 
invention and discussed in Example 37. 

Figure 4 shows a synthetic scheme according to the 
invention and discussed in Example 41. 

25 DESCRIPTION OF PREFERRED EMBODIMENTS 

This invention is directed to novel PNA molecules and 
novel linked PNA molecules. The linked PNA molecules are 
formed from PNA strands that are joined together with a linking 
segment. These novel, linked molecules are herein referred to 
3 0 as "bis PNAs," Bis PNAs have been shown to have improved 
binding, specificity and recognition properties over single 

stranded PNAs. 

In accordance with this invention, it has been found 
that the most stable triplexes that are formed between two 
35 single stranded PNAs or a bis PNA and a DNA or RNA target 
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strand are triplexes wherein the Watson/Crick base pairing 
strand is in an anti -parallel orientation relative to the 
target strand and the Hoogsteen base pairing strand is in a 
parallel orientation relative to the target strand. As so 
5 orientated to the target strand, the two PNA strands are 
therefore anti-parallel to each other. 

In the PNA molecules and linked PNA molecules or bis 
PNAs of the invention as shown in the structures of Formula I 
above, ligand L is primarily a naturally occurring nucleobase 

10 attached at the position found in nature, i.e., position 9 for 
adenine or guanine, and position 1 for thymine or cytosine. 
Alternatively, L may be a non-naturally occurring nucleobase 
(nucleobase analog) , another base -binding moiety, an aromatic 
mciety, (C L -C 4 ) alkanoyl, hydroxy or even hydrogen. In certain 

15 preferred embodiments at least one L in the structure is a C- 
pyrimidine heterocyclic base or an iso-pyrimidine heterocyclic 
base. In other embodiments L can be a DNA intercalator, a 
reporter ligand such as, for example, a fluorophor, radio 
label, spin label, hapten, or a protein-recognizing ligand such 

20 as biotin. 

For purposes of this invention, the term "pyrimidine" 
refers to any 1,3-diazine, irrespective of its substituents or 
position of attachment the other molecular entities. 
Pyrimidines according to the invention include both naturally- 

25 occurring and synthetic nucleobases bases and their analogs. 
C-pyrimidine nucleobases are nucleobases that if located in a 
nucleoside would be connected to the sugar portion of the 
. nucleoside via a carbon atom of the pyrimidine ring. As used 
with peptide nucleic acids of the invention, in a like manner 

30 tc the above described nucleoside connections, the C-pyrimidine 
bases are connected to the peptide nucleic acid backbone via 
a carbon atom of the pyrimidine ring, Iso-pyrimidines 
according to the invention are 4 -keto-2-amino- , 4-thio-2-amino, 
2 - thio- 4 - keto , and 2 - keto- 4 - t hio- disubst ituted pyrimidines ♦ 

3 5 Pseudo-pyrimidines are those that are directly or indirectly 
bound to a PNA strand through the pyrimidine 5 -position. 
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During synthesis L may be blocked with protecting 
groups. Suitable protecting groups are acid, base or hydrogen- 
olytically or photochemically cleavable protecting groups such 
as, for example, t-butoxycarbonyl (Boc) , f luorenylmethyl- 
5 oxycarbonyl (Pmoc) , benzyloxycarbonyl (Z or CBZ) , benzoyl, 2- 
chlorobenzyloxycarbonyl , or 2-nitrobenzyl (2Nb) . 

A can be a wide variety of groups such as -CRVCO-, 
-CR^CS-, -CR l R 3 CSe-, -CR^CNHR 3 - , -CR^C^CH^ and 
-CR^OCfCHa)^-, where R 1 , R 2 and R 3 are as defined above. 
10 Preferably, A is methylenecarbonyl (-CH 2 CO-). Also, A can be 
a longer chain moiety such as propanoyl, butanoyl or pentanoyl, 
or corresponding derivative, wherein O is replaced by another 
value of X or the chain is substituted with R*R 2 or is 
heterogenous, containing Y. Further, A can be a (C 3 - 
15 CJalkylene chain, a (C 2 -C 6 ) alkylene chain substituted with R X R 2 
or can be heterogenous, containing Y. In certain cases, A can 
just be a single bond. 

In certain preferred embodiments of the invention 
B is a nitrogen atom, thereby presenting the possibility of an 
20 achiral backbone. B can also be R 3 N\ where R 3 is as defined 
above. B can also be a CH group. 

In certain preferred embodiments of the invention, 
C is (-CR 6 R 7 -) y , where R* and R 7 are as defined above. R 6 and R 7 
also can be a heteroaryl group such as, for example, pyrrolyl, 
25 furyl, thienyl, imidazolyl, pyridyl, pyrimidinyl , indolyl, or 
can be taken together to complete an alicyclic system such as, 
fcr example, 1,2-cyclobutanediyl, 1, 2-cyclopentanediyl or 1,2- 
' eye lohexanediy 1 . 

In certain preferred embodiments of the invention D 
30 is a CH 2 group. D may also be CR 6 R 7 where R 6 and R 1 are as 
defined above. 

In certain preferred embodiments of the invention G 
is selected from -NR 3 CO-, -NR 3 CS- , -NR 3 SO- or -NR 3 S0 2 -, in either 
orientation, where R 3 is as defined above. 
35 The amino acids and the amino acid analogs that form 

the backbone of the peptide nucleic acids of the invention may 
be identical or different. We have found that those based on 
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N- (2 -aminoethyl) glycine are especially well suited to the 
purpose of the invention however a wide range of amino acid 
analogs may be used in the context of the invention. 

The linking segments of the present invention are 
5 compounds that are capable of linking two PNA strands together. 
The preferred orientation is to link the C terminus of a first 
PNA molecule to the N terminus of a second PNA molecule. Two 
presently preferred linking segments for linking the PNAs 
segments are "egl groups" (ethylene glycol) and "Aha groups" 
10 (amino hexanoic acid) linked together by amino acid groups. 
A further presently preferred linking segment includes the 
above Aha groups interspaced with cy-amino acids particularly 
glycine or lysine, 

A wide range of other compounds are also useful tor 

15 the linking segment and thus are included within the scope of 
the present invention. Generally the linking segment is a 
compound having a primary amino group and a carboxy group 
separated with a space spanning group wherein the space 
spanning group is made up of one or more functional groups. 

20 Some representative space spanning groups are C x to C 20 alkyl, 
C 2 to C 20 alkenyl, C 2 to C 20 alkynyl, c x to C 20 alkanoyl having at 
least one 0 or S atom, C 7 to C 34 aralkyl, C 6 -C 14 aryl and amino 
acids. Preferred alkanoyl groups can have from 1 to 10 hetero 
atoms (O or S) . Preferred alkanoyl groups include methyl, 

25 ethyl and propyl alkanoxy particularly polyethoxy, i.e., 
ethylene glycol. Amino acids including D, L, and DL isomers 
of a- amino acids as well as longer chained amino acids may also 
be linked together to form a linking segment. A particularly 
preferred amino acid is hexanoic amino acid. Aralkyl groups 

3 0 used as space spanning groups may have the amino or the carboxy 
group located on the aromatic ring or spaced with one or more 
CH- groups wherein the total number of CH 2 groups is less than 
or equal to twenty. The position of substitution in an aralkyl 
linked PNA may be varied; however, ortho and meta are presently 

35 preferred because substitution at these positions, especially 
ortho, induce the bis PNA to be bent, thus, facilitating 
location of the two joined peptide nucleic acid strands in 



PCT/US95/09084 

WO 96/02558 

- 17 - 

• parallel to one another. Another group of 
spacial locations parallel incorporate 
bi s PNAs that include induced bends are th 

cis -alkenyl ^ ^^^ compatibility with 

IB selecting a ^ ^ on one end 

5 PNA chemistry and ability to second pNA ig 

of a PNA to a ^^^^ can be selected so 
a ^ PNAS are able to 

as to be f leXlbl ^ UC s h ' or dsDNA in much the same way that 
interact with ssDNA, ssRNA ° r Some pre ferred 

10 two independent PNA single *^ \ q fae effective are 23 
linking segments that have been shown 

and 24 atoms in length. imDr0 ved binding affinity, 

n is PNAs have shown improveu 

««p e ificity over single stranded PNAs 
thermal stability, ^ "^""J^ shown that the preferred 
15 Us ing dsDNA as a target _xt ha ^ ^ ^ ^ pNA 

orientation is with the fir ^ q£ the 

parallel to the target i.: ^ ^ first pNA 

duplex is referenced in a 5 ^ ^ ^ second 

is complementary m an N co the targe t, i.e. it 
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to 3' direction in a ^ Qrientati to 

segment connects the PNA nce point, one strand is 
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each other, i.e. trom lined up in a 

• ^ m t"o C direction and the otnet 
25 lined up m a N to c an 

C to N direction. baund by theory it is 

A lthough we *° ™l^l and Qf tne bis PNA binds the 
believed that the antxparallel ^ ^ strand 

DNA target ^«^T*T^Cri* nature. The 
30 invasion. This bindi ng » o ^ ^ 

second PNA strand of the hydroge n bonding. It has 

binds the DNA using H-g^J ^ / stranded pNAs and 
been shown using the component sing^ ^ ^ ^ ^ ^ 
comparing them separately ana 

it binds faster to tne 
35 the bis PNA has a faster on • • ^ ^ ^ enforced 

target. This faster on rate . ^ ^ 

■ _.8»-,r of the second straw 
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We have also studied the effect of pH on the Tm of 
bis PNA bound to dsDNA as compared to the same bis PNA with the 
cytosines replaced with pseudo isocytosines . It has been 
observed in previous studies that there is a pronounced 
5 dependence on pH for binding of PNA to dsDNA. The decrease in 
Tm with higher pH shows that Hoogsteen binding in a (PNA) 2 /DNA 
complex is pH dependent. Normal Hoogsteen binding requires 
that the cytosines be protonated. This makes the Hoogsteen 
strand binding pH dependent. We have found that replacement 

10 of one or more of the cytosine nucleobases in a Hoogsteen 
strand with pseudo isocytosine and other like nucleobases 
removes this dependence. To demonstrate this effect, in two 
bis PNAs of the invention, one was synthesized such that the 
cytosines nucleobases in the parallel strand were replaced with 

15 pseudo isocytosines and the other was synthesized such that the 
cytosines in the antiparallel strand were replaced with pseudo 
isocytosines. The bis PNA with the pseudo isocytosines in the 
parallel strand showed almost no dependence on pH indicating 
that the parallel strand is involved with Hoogsteen binding. 

20 The replacement of cytosine by pseudo isocytosine or 

other like C-pyrimidine nucleobases is effected in a straight 
forward manner as per certain of the examples set forth below. 
This is in direct contrast with replacement of cytosine with 
pseudo isocytosine or other C-pyrimidines in nucleosides. In 

2 5 nucleosides, an anomeric specific carbon-carbon bond must be 

formed in synthesizing the C-nucleoside . Since there are no 
anomeric (sugar) carbon atoms in peptide nucleic acids, such 
constraints need not be considered. 

In a further aspect of the invention, the PNA and bis 
30 PNAs are conjugated to low molecular weight effector ligands 
such as ligands having nuclease activity or alkylating activity 
or reporter ligands (fluorescent, spin labels, radioactive, 
protein recognition ligands, for example, biotin or haptens) * 
In a further aspect of the invention, the PNAs and bis PNAs are 

3 5 conjugated to peptides or proteins, where the peptides have 

signaling activity and the proteins are, for example, enzymes, 
transcription factors or antibodies. Also, the PNAs can be 
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attached to water-soluble or water- insoluble polymers. In 
another aspect of the invention, the PNAs and bis PNAs are 
conjugated to oligonucleotides or carbohydrates. When 
warranted, a PNA or bis PNA can be synthesized attached to a 
5 further moiety (e.g., a peptide chain, reporter, intercalator 
or other type of ligand-containing group) that in turn is 
attached to a solid support. As with other PNAs of the 
invention, such PNA conjugates can be used for gene modulation 
(e.g., gene targeted drugs), for diagnostics, as biotechnology 
10 and research probes, primers, artificial restriction enzymes 
ar.d the like. 

As a further aspect of the invention, PNAs and bis 
PKAs can be used to target RNA and ssDNA to produce both 
complementary type gene regulating moieties and hybridization 

15 probes for the identification and purification of nucleic 
acids* Furthermore, the PNAs and bis PNAs can be modified in 
such a way that they can form triple helices with dsDNA. 
Reagents that bind sequence-specif ically to dsDNA have 
applications as gene targeted drugs. These are foreseen as 

20 extremely useful drugs for treating diseases like cancer, AIDS 
and other virus infections, and may also prove effective for 
treatment of some genetic diseases. Furthermore, these 
reagents may be used for research and in diagnostics for 
detection and isolation of specific nucleic acids. 

25 The triple helix principle is used in the art for 

sequence -specif ic recognition of dsDNA. Triple helix formation 
utilizes recognition of homopurine-homopyrimidine sequences. 
' A strand displacement complex with triple helix formation is 
superior to simple triple helix recognition in that strand 

30 displacement complexes are very stable at physiological 
conditions, that is, neutral pH, ambient (20-40 °C) temperature 
and medium (100-150 mM) ionic strength. 

Gene targeted drugs are designed with a nucleobase 
sequence (containing from about 10 to about 20 units) 

35 complementary to the regulatory region (the promoter) of the 
target gene. Therefore, upon administration of the drug, it 
binds to the promoter and blocks access thereto by RNA 
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polymerase. Consequently, no mRNA, and thus no gene product 
(protein), is produced. If the target is within a vital gene 
for a virus, no viable virus particles will be produced. 
Alternatively, the target could be downstream from the 
5 promoter, causing the RNA polymerase to terminate at this 
position, thus forming a truncated mRNA/protein which is 
nonfunctional . 

Sequence- specif ic recognition of ssDNA by base 
complementary hybridization can likewise be exploited to target 

10 specific genes and viruses. In this case, the target sequence 
is contained in the mRNA such that binding of the drug to the 
target hinders the action of ribosomes and, consequently, 
translation of the mRNA into protein. The bis PNAs of the 
invention are superior to prior reagents in that they have 

15 significantly higher affinity for complementary ssDNA. Also, 
they can be synthesized such that they possess no charge and 
are water soluble, which should facilitate cellular uptake, and 
they contain amides of non-biological amino acids, which should 
make them biostable and resistant to enzymatic degradation by, 

20 for example, proteases. 

The bis-PNAs and the C-pyrimidine and iso-pyrimidine 
nucleobase containing PNAs of the invention are particularly 
useful for diagnostic assays and molecular biological cloning 
ar.a sub-cloning techniques that can take advantage of the 

25 strand displacement effect that occurs upon binding of the bis- 
PNAs to double stranded DNA. Further they can also be 
advantageously used for transcription inhibition useful in 
diagnostic tests and for modification of PCR based assays since 
they exhibit a even greater base mismatch specificity than does 

30 normal PNA. 

Synthesis of monomer ic building blocks 

The monomeric building blocks of the present 
invention are composed of an amino acid or amino acid analog 
backbone portion and a nucleobase portion. A more generalized 
35 description would be a backbone with a carboxyl functional 
group, an amino functional group and at least one other 
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functional group e.g. a nucleobase or nucleobase analog. The 
monomeric building blocks are preferably synthesized by a 
general procedure that varies depending on the monomer being 
synthesized. This involves preparation of a backbone portion 
5 of the monomeric building block prior to the addition of the 
nucleobase and any tethered functional moieties, e.g. N{2- 
aminoethyl) glycine. Illustrative examples are described in 
Examples 1, 2, 7, 8 and 9. Next, the desired nucleobase or 
nucleobase analog is covalently bound to the backbone portion 

10 to give the monomeric building block. The synthesis of the 
thymine monomer is illustrated in Examples 3-6, and that of the 
protected cytosine monomer is illustrated in Example 9-17. 

The synthesis of the protected adenine monomer, as 
is illustrated in Examples 18-22, involved alkylation with 

15 ethyl bromoacetate and verification of the position of 
substitution by X-ray crystallography, as being the wanted 9- 
position. The N*-amino group then was protected with the 
benzyloxycarbonyl group by the use of the reagent N-ethyl-ben- 
zyloxycarbonylimidazole tetraf luoroborate. Simple hydrolysis 

20 of the product ester gave N 6 -benzyloxycarbonyl - 9 -carboxymethyl 
adenine, which then was used in the standard procedure. 

The synthesis of the protected G-monomer is 
illustrated in examples 23-25, The starting material, 2 -amino- 
6-chloropurine, was alkylated with bromoacetic acid and the 

25 chlorine atom was then substituted with a benzyloxy group. The 
resulting acid was coupled to (boc-aminoethyl) glycine methyl 
ester with agent PyBrop™, and the resulting ester was 
hydrolysed. The O 6 -benzyl group was removed in the final HF- 
cleavage step in the synthesis of the PNA-oligomer . Cleavage 

30 was verified by finding the expected mass of the final PNA- 
oligomer, upon incorporation into a PNA-oligomer using 
diisopropyl carbodiimide as the condensation agent. 

The synthesis of monomers having C-pyrimidine and 
iso-pyrimidine heterocyclic bases and their incorporation into 

35 PNAs and bis PNAs is illustrated in further of the examples. 
The replacement of the cytosines with pseudo isocytosines in 
the parallel strand of a bis PNA that contains an anti-parallel 
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strand has been shown to be stable in a range of pH's whereas 
the same bis PNA shows a pH dependence when cytosine is 
present. This effect is illustrated in Example 61. 

The synthesis of the pseudo isocytosine monomer is 
5 illustrated by Examples 26-32. The synthesis of other 
monomeric building blocks having either iso-cytosine, 5-bromo 
uracil, or pseudo uracil are illustrated by Examples 33-44. 

Synthesis of PKAs and bis PNAs 

Synthesis of PNAs and bis PNAs involve attachment of 

10 a first monomeric building block to a solid support. Next, 
elucidation of the desired PNA is achieved through an iterative 
process involving deprotecting and coupling. If the desired 
molecule is a bis PNA, a tether is incorporated in much the 
same manner as a monomeric building block is incorporated 

15 followed by another iterative process as above to elucidate the 
second PNA chain of desired sequence. 

The principle of anchoring molecules onto a solid 
matrix, which helps in accounting for intermediate products 
during chemical transformations, is known as Solid- Phase 

20 Synthesis or Merrifield Synthesis (see, e.g., Merrifield, J. 
Am. Chem. Soc. , 1963, 85, 2149 and Science, 1986, 232, 341). 
Established methods for the stepwise or fragmentwise solid- 
phase assembly of amino acids into peptides normally employ a 
beaded matrix of slightly cross-linked styrene-divinylbenzene 

25 copolymer, the cross -linked copolymer having been formed by the 
pearl polymerization of styrene monomer to which has been added 
a mixture of divinylbenzenes . A level of 1-2% cross-linking 
is usually employed. Such a matrix can also be used in solid- 
phase PNA synthesis in accordance with the present invention 

30 (Figure 8) . 

Concerning the initial f unctionalization of the solid 
phase, more than fifty methods have been described in 
connection with traditional solid-phase peptide synthesis (see, 
e.g., Barany and Merrifield in "The Peptides" Vol. 2, Academic 
35 Press, New York, 1979, pp. 1-284, and Stewart and Young, "Solid 
Phase Peptide Synthesis", 2nd Ed., Pierce Chemical Company, 
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Illinois, 1984). Reactions for the introduction of 
chloromethyl functionality (Merrifield resin; via a 
chloromethyl methyl ether/SnCl« reaction) . aminomethyl 
functionality (via an N-hydroxymethylphthalimide reaction; see. 
Mitchell, et al.. Tetrahedron Lett.. 1976, 3795). and 
benzhydrylamino functionality (Pietta, et al., J. Chem. Soc, 
1970, 650) are the most widely applied. Regardless of its 
nature, the purpose of the functionality is normally to form 
an anchoring linkage between the copolymer solid support and 
, the C-terminus of the first monomeric building block to be 
coupled to the solid support. As will be recognized, anchoring 
linkages also can be formed between the solid support and the 
N-tenninuB of the monomeric building block. It is generally 
convenient to express the .-concentration" of a functional group 
3 ir terms of millimoles per gram (mmol/g) . Other reactive 
functionalities which have been initially introduced include 
4-methylbenzhydrylamino and 4-methoxybenzhydrylamino. All of 
these established methods are in principle useful within the 
context of the present invention. Preferred methods for PNA 
0 synthesis employ aminomethyl as the initial functionality, in 
that aminomethyl is particularly advantageous with respect to 
fn e incorporation of "spacer" or "handle" groups, owing to the 
reactivity of the amino group of the aminomethyl functionality 
vrth respect to the essentially quantitative formation of amide 
•5 bonds to a carboxylic acid group at one end of the spacer- 
forming reagent. A vast number of relevant spacer- or handle- 
forming bifunctional reagents have been described (see, Barany, 
e- al int. J. Peptide Protein Res., 1987, 30. 705), 
esoeciaily reagents which are reactive towards amino groups 
30 such as found in the aminomethyl function. Representative 
b- functional reagents include 4- (haloalkyl) aryl-lower alkanoic 
a^ids such as 4- (bromomethyl) phenylacetic acid, Boc-aminoacyl- 
a- (oxymethyl) aryl-lower alkanoic acids such as Boc-aminoacyl-4- 
ioxymethyDphenylacetic acid, N-Boc-p-acylbenzhydrylamines such 
35 as N-Boc- P -glutaroylbenzhydrylamine, N-Boc-4 ' -lower alkyl-p- 
acylbenzhydrylamines such as N-Boc - 4 < -methyl -p- 
o-utaroylbenzhydrylamine, N-Boc-4 • -lower alkoxy-p-acylbenz- 
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hydryl amines such as N-Boc-4 # -methoxy-p-glutaroylbenzhy- 
drylamine, and 4-hydroxymethylphenoxyacetic acid. One type of 
spacer group particularly relevant within the context of the 
present invention is the phenylacetamidomethyl (Pam) handle 
5 (Mitchell and Merrif ield, J. Org. Chem., 1976, 41, 2015) which, 
deriving from the electron withdrawing effect of the 4- 
phenylacetamidomethyl group, is about 100 times more stable 
than the classical benzyl ester linkage towards the Boc-amino 
deprotection reagent trif luoroacetic acid (TFA) . 

10 Certain functionalities (e.g., benzhydryl amino, 4- 

methylbenzhydryl amino and 4-methoxybenzhydrylamino) which may 
be incorporated for the purpose of cleavage of. a synthesized 
PNA or bis PNA chain from the solid support such that the C- 
terminal of the PNA or bis PNA chain is in amide form, not 

15 requiring the introduction of a spacer group. Any such 
functionality may advantageously be employed in the context of 
the present invention. 

An alternative strategy concerning the introduction 
of spacer or handle groups is the so-called "preformed handle" 

20 strategy (see, Tarn, et al., Synthesis, 1919, 955-957), which 
offers complete control over coupling of the first monomer ic 
building block, and excludes the possibility of complications 
arising from the presence of undesired functional groups not 
related to the PNA synthesis. In this strategy, spacer or 

25 handle groups, of the same type as described above, are reacted 
with the first monomeric building block desired to be bound to 
the solid support, the monomeric building block being N-protec- 
' ted and optionally protected at the other side-chains which are 
not relevant with respect to the growth of the desired PNA 

3 0 chain. Thus, in those cases in which a spacer or handle group 
is desirable, the first monomeric building block to be coupled 
to the solid support can either be coupled to the free reactive 
end of a spacer group which has been bound to the initially 
introduced functionality (for example, an aminomethyl group) 

3 5 or can be reacted with the spacer- forming reagent. The space - 
forming reagent is then reacted with the initially introduced 
functionality. Other useful anchoring schemes include the 



WO 96/01558 



PCT/US95/09084 



- 25 - 

"multidetachable" resins (Tarn, etal., Tetrahedron Lett . t 1979, 
4935 and J. Am. Chem. Soc, 1980, 102, 611; Tarn, J\ Org. Chem., 
1985, 50, 5291), which provide more than one mode of release 
and thereby allow more flexibility in synthetic design. 
5 Suitable choices for N-protection are the tert- 

butyloxycarbonyl (Boc) group (Carpino, J. Am. Chem. 5oc, 1957, 
79, 4427; McKay, et al., J. Am. Chem. Soc, 1957, 79, 4686; 
Anderson, et al., J. Am. Chem. Soc, 1957, 79, 6180) normally 
in combination with benzyl-based groups for the protection of 

10 side chains, and the 9-f luorenylmethyloxycarbonyl (Fmoc) group 
(Carpino, et al., J. Am. Chem. Soc, 1970, 92, 5748 and J. Org. 
Chem., 1972, 37, 3404) , normally in combination with tert-butyl 
(tBu) for the protection of any side chains, although a number 
of other possibilities exist which are well known in 

15 conventional solid-phase peptide synthesis. 

Thus, a wide range of other useful amino protecting 
groups exist, some of which are Adoc {Hass, et al., J. Am* 
Chem. Soc, 1966, 88, 1988), Bpoc (Sieber, Helv. Chem. Acta., 
1968, 51, 614), Mcb (Brady, et a2., J. Org. Chem., 1977, 42, 

20 143), Bic (Kemp, et al., Tetrahedron, 1975, 4624), the o- 
nitrophenylsulfenyl (Nps) (Zervas, et ah, J. Am. Chem. Soc, 
1963, 85, 3660), and the dithiasuccinoyl (Dts) (Barany, etal., 
J. Am. Chem. Soc, 1977, 99, 7363). These amino protecting 
groups, particularly those based on the widely-used urethane 

25 functionality, successfully prohibit racemization (mediated by 
tautomerization of the readily formed oxazolinone (azlactone) 
intermediates (Goodman, et ai., J. Am. Chem. Soc, 1964, 86, 
2918) during the coupling of most a-amino acids. In addition 
to such amino protecting groups, a whole range of nonurethane- 

30 type of amino protecting groups are applicable when assembling 
PNA molecules, especially those built from achiral units. 
Thus, not only the above-mentioned amino protecting groups (or 
those derived from any of these groups) are useful within the 
context of the present invention, but virtually any amino 

35 protecting group which largely fulfills the following 
requirements: (1) stability to mild acids (not significantly 
attacked by carboxyl groups) ; (2) stability to mild bases or 
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nucleophiles (not significantly attacked by the amino group in 
question); (3) resistance to acylation (not significantly 
attacked by activated amino acids or activated monomeric 
building blocks) . Additionally: (4) the protecting group must 
5 be close to quantitatively removable, without serious side 
reactions, and (5) the optical integrity, if any, of the 
incoming monomeric building block should preferably be highly 
preserved upon coupling. Finally, the choice of side -chain 
protecting groups, in general, depends on the choice of the 

10 amino protecting group, since the protection of side-chain 
functionalities must withstand the conditions of the repeated 
amino deprotection cycles. This is true whether the overall 
strategy for chemically assembling PNA or bis PNA molecules 
relies on, for example, differential acid stability of amino 

15 and side-chain protecting groups (such as is the case for the 
above-mentioned "Boc- benzyl" approach) or employs an 
orthogonal, that is, chemoselective, protection scheme (such 
as is the case for the above-mentioned "Fmoc-tBu" approach) , 

Following coupling of the first monomeric building 

20 block, the next stage of solid-phase synthesis is the 
systematic elaboration of the desired PNA chain. This 
elaboration involves repeated deprotection/coupling cycles. 
The temporary protecting group, such as a Boc or Fmoc group, 
on the last -coupled monomeric building block is quantitatively 

25 removed by a suitable treatment, for example, by acidolysis, 
such as with trif luoroacetic acid, in the case of Boc, or by 
base treatment, such as with piperidine, in the case of Fmoc, 
' so as to liberate the N-terminal amine function. 

The next desired N-protected monomeric building block 

3 0 is then coupled to the N-terminal of the last -coupled monomeric 
building block. This coupling of the C- terminal of a monomeric 
building block with the N-terminal of the last-coupled 
monomeric building block can be achieved in several ways . For 
example, it can be bound by providing the incoming monomeric 

3 5 building block in a form with the carboxyl group activated by 
any of several methods, including the initial formation of an 
active ester derivative such as a 2 , 4 , 5- trichlorophenyl ester 
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(Piess, et al., Helv. Chim. Acta, 1963, 46, 1609), a 
phthalimido ester (Nefkens, et al., J. Am. Chem. Soc, 1961, 
83 , 1263), a pentachlorophenyl ester (Kuprys2ewski, Rocz. 
Chem., 1961, 35, 595), a pentaf luorophenyl ester (Kovacs, et 
5 al., J. Am. Chem. Soc, 1963, 85, 183), an o-nitrophenyl ester 
(Bodanzsky, Nature, 1955, 175, 685), an imidazole ester {Li, 
et al., J. Am. Chem. Soc, 1970, 92, 7608), and a 3-hydroxy-4- 
oxo-3 , 4-dihydroquinazoline (Dhbt-OH) ester (Konig, et al., 
Chem. Ber. , 1973, 103, 2024 and 2034), or the initial formation 

10 of an anhydride such as a symmetrical anhydride (Wieland, et 
al,, Angew. Chem., Int. Ed. Engl., 1971, 10, 336). 
Alternatively, the carboxyl group of the incoming monomeric 
building block can be reacted directly with the N-terminal of 
the last -coupled monomeric building block with the assistance 

15 of a condensation reagent such as, for example, 
dicyclohexylcarbodiimide (Sheehan, et al.', J. Am. Chem. Soc, 
1955, 77, 1067) or derivatives thereof. Benzotriazolyl N-oxy- 
trisdimethylaminophosphonium hexaf luorophosphate (BOP) , 
"Castro's reagent" (see, e.g., Rivaille, et al . , Tetrahedron, 

20 1980, 36, 3413) is recommended when assembling PNA or bis PNA 
molecules containing secondary amino groups. Finally, 
activated PNA monomers analogous to the recently-reported amino 
acid fluorides (Carpino, J. Am. Chem. Soc, 1990, 112, 9651) 
hold considerable promise to be used in PNA and bis PNA 

25 svr.thesis as well. 

The synthesis of a bis PNA from a PNA chain attached 
tc the solid support is similar to the iterative process that 
* is used to synthesize the PNA chain* The last desired 
mcr.omeric building block is coupled and the gel is washed with 

3 0 a suitable solvent e.g. pyridine. The terminal N protecting 
group is removed and an activated linking segment is coupled. 
The linking segment may be a single unit or as is the case with 
the ethyleneglycol or aminohexanoic acid type linking segments 
(Examples 47 and 55) the linking segment is added in sub units 

35 which, when coupled together will give the desired linking 
segment. Synthesis of the second segment of PNA is effected 
as per the first segment. 
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Following assembly of the desired PNA or bis PNA 
chain, including protecting groups, the next step will normally 
be deprotection of the coupled building blocks of the PNA or 
bis PNA chain and cleavage of the synthesized PNA or bis PNA 
5 from the solid support. These processes can take place 
substantially simultaneously, thereby providing the free PNA 
or bis PNA molecule in the desired form. Alternatively, in 
cases in which condensation of two separately synthesized PNA 
chains is to be carried out, it is possible by choosing a 

10 suitable spacer group at the start of the synthesis to cleave 
the desired PNA or bis PNA chains from their respective solid 
supports (both peptide chains still incorporating their side- 
chain protecting groups) and finally removing the side -chain 
protecting groups after, for example, coupling the two side- 

15 chain protected peptide chains to form a longer PNA or bis PNA 
chain. 

In the above-mentioned "Boc-benzyl" protection 
scheme, the final deprotection of side-chains and release of 
the PNA or bis PNA molecule from the solid support is most 

20 often carried out by the use of strong acids such as anhydrous 
HF (Sakakibara, etal., Bull. Chew. Soc. Jpn., 1965, 38, 4921), 
boron tr is (trif luoroacetate) (Pless, etal., Helv. Chim. Acta, 
1973, 46, 1609), and sulfonic acids such as trif luoromethane- 
sulfonic acid and methanesulfonic acid (Yajima, et al., J. 

25 Chem. Soc, Chem. Comm., 1974, 107). This conventional strong 
acid (e.g., anhydrous HF) deprotection method, produces very 
reactive carbocations that may lead to alkylation and acylation 
cf sensitive residues in the PNA chain. Such side-reactions 
are only partly avoided by the presence of scavengers such as 

30 ar.isole, phenol, dimethyl sulfide, and mercaptoethanol and, 
therefore, the sulf ide-assisted acidolytic S M 2 deprotection 
method (Tarn, et al . , J. Am. Chem. Soc, 1983, 105, 6442 and J. 
Am. Chem. Soc, 1986, 108, 5242), the so-called "low", which 
removes the precursors of harmful carbocations to form inert 

35 sulfonium salts, is frequently employed in peptide and PNA 
synthesis, either solely or in combination with "high" methods. 
Less frequently, in special cases, other methods used for 
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deprotection and/or final cleavage of the PNA-solid support 
bond are, for example, such methods as base -catalyzed al- 
coholysis {Barton, et al . , J. Am. Chem. Soc*, 1973, 95, 4501), 
and ammono lysis as well as hydrazinolysis (Bodanszky, et al., 
5 Chem. Ind. , 1964 1423), hydrogenolysis (Jones, Tetrahedron 
Lett,, 1977 2853 and Schlatter, et al-, Tetrahedron Lett., 
1977, 2861), and photolysis (Rich and Gurwara, J. Am. Chem. 
Soc, 1975, 97, 1575) . 

Finally, in contrast with the chemical synthesis of 

10 "normal " peptides, stepwise chain building of achiral PNAs and 
bis PNAs such as those based on aminoethylglycyl backbone units 
can start either from the N- terminus or the C- terminus, because 
the coupling reactions are free of racemization. Those skilled 
in the art will recognize that whereas syntheses commencing at 

15 the C- terminus typically employ protected amine groups and free 
or activated acid groups, syntheses commencing at the N- 
terminus typically employ protected acid groups and free or 
activated amine groups. 

Based on the recognition that most operations are 

20 identical in the synthetic cycles of solid-phase peptide 
synthesis (as is also the case for solid-phase PNA and bis PNA 
synthesis) , a new matrix, PEPS, was recently introduced (Berg, 
et al., J. Am. Chem. Soc, 1989, 111, 8024 and International 
Patent Application WO 90/02749) to facilitate the preparation 

25 of large numbers of peptides. This matrix is comprised of a 
polyethylene (PE) film with pendant long -chain polystyrene (PS) 
grafts (molecular weight on the order of 10 6 ) , The loading 
capacity of the film is as high as that of a beaded matrix, but 
PEPS has the additional flexibility to suit multiple syntheses 

30 simultaneously. Thus, in a new configuration for solid-phase 
peptide synthesis, the PEPS film is fashioned in the form of 
discrete, labeled sheets, each serving as an individual 
compartment. During all the identical steps of the synthetic 
cycles, the sheets are kept together in a single reaction 

35 vessel to permit concurrent preparation of a multitude of 
peptides at a rate close to that of a single peptide by 
conventional methods. It was reasoned that the PEPS film 
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support, comprising linker or spacer groups adapted to the 
particular chemistry in question, should be particularly 
valuable in the synthesis of multiple PNA and bis PNA 
molecules, these being conceptually simple to synthesize since 
5 only four different reaction compartments are normally 
required, one for each of the four "pseudo- nucleotide" units. 
Thus, the PEPS film support has been successfully tested in a 
number of PNA syntheses carried out in a parallel and 
substantially simultaneous fashion. The yield and quality of 

10 the products obtained from PEPS were comparable to those 
obtained by using the traditional polystyrene beaded support. 
Also, experiments with other geometries of the PEPS polymer 
such as, for example, non- woven felt, knitted net, sticks or 
microwell plates have not indicated any limitations of the 

15 synthetic efficacy. 

Two other methods proposed for the simultaneous 
synthesis of large numbers of peptides also apply to the prepa- 
ration of multiple, different PNA molecules. The first of 
these methods (Geysen, et al., Proc. Natl. Acad. Sci. USA, 

20 1984, 82, 3998) utilizes acrylic acid-grafted polyethylene-rods 
and 96-microtiter wells to immobilize the growing peptide 
chains and to perform the compartmentalized synthesis. While 
highly effective, the method is only applicable on a microgram 
scale. The second method (Houghten, Proc. Natl. Acad. Sci. 

25 USA, 1985, 82, 5131) utilizes a "tea bag" containing 
traditionally-used polymer beads. Other relevant proposals for 
multiple peptide, PNA of bis PNA synthesis in the context of 
the present invention include the simultaneous use of two 
different supports with different densities (Tregear, in 

30 " Chemistry and Biology of Peptides", J. Meienhofer, ed. , Ann 
Arbor Sci. Publ . , Ann Arbor, 1972 pp. 175-178), combining of 
reaction vessels via a manifold (Gorman, Anal. Biochem. , 1984, 
136, 397), multicolumn solid-phase synthesis (e.g. Krchnak, et 
al., Int. J. Peptide Protein Res . , 1989, 33, 209), and Holm and 

3 5 Meldal, in " Proceedings of the 20th European Peptide 
Symposium", G. Jung and E. Bayer, eds., Walter de Gruyter & 
Co., Berlin, 1989, 208-210), and the use of cellulose paper 
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(Eichler, et al. f Collect. Czech. Chem. Comimm., 1989, 54, 
1746), and U.S. Patent 5,324,483 issued June 28, 1994. 

While the conventional cross -linked s tyrene/di vinyl - 
benzene copolymer matrix and the PEPS support are presently 
5 preferred in the context of solid-phase PNA and bis PNA 
synthesis, a non- limiting list of examples of solid supports 
which may be of relevance are: (1) particles based upon 
copolymers of dimethylacrylamide cross-linked with N,N'- 
bisacryloylethylenediamine, including a known amount of N- 

10 tertbutoxycarbonyl-beta-alanyl-N' -acryloylhexamethylenediamine . 
Several spacer molecules are typically added via the beta 
alanyl group, followed thereafter by the amino acid residue 
summits. Also, the beta alanyl containing monomer can be 
replaced with an acryloyl sarcosine monomer during 

15 polymerization to form resin beads. The polymerization is 
followed by reaction of the beads with ethylenediamine to form 
resin particles that contain primary amines as the covalently 
linked functionality. The polyacrylamide-based supports are 
relatively more hydrophilic than are the polystyrene -based 

20 supports and are usually used with polar aprotic solvents 
including dimethylf ormamide, dimethylacetamide, N- 
methylpyrrolidone and the like (see Atherton, et al., <J. Am. 
Chem. Soc, 1975, 97, 6584, Bioorg. Chem. 1979, 8, 351), and 
CC.S. Perkin 1 538 (1981)); (2) a second group of solid 

25 supports is based on silica-containing particles such as porous 
glass beads and silica gel. One example is the reaction 
product of trichloro- [3- (4-chloromethyl) phenyl] propylsilane and 
' porous glass beads (see Parr and Grohmann, Angew. Chem. 
Internal. Ed. 1972, 11, 314) sold under the trademark "PORASIL 

30 E" by Waters Associates, Framingham, MA, USA. Similarly, a 
mono ester of 1 , 4-dihydroxyme thy 1 benzene and silica (sold under 
the trademark "BIOPAK" by Waters Associates) has been reported 
to be useful (see Bayer and Jung, Tetrahedron Lett., 1970, 
4 503) ; (3) a third general type of useful solid supports can 

35 be termed composites in that they contain two major 
ingredients: a resin and another material that is also 
substantially inert to the organic synthesis reaction 
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conditions employed. One exemplary composite (see Scott, et 
al., J. Chrom. Sci., 1971, 9, 577) utilized glass particles 
coated with a hydrophobic, cross -linked styrene polymer 
containing reactive chloromethyl groups, and was supplied by 
5 Northgate Laboratories, Inc., of Hamden, CT, USA. Another 
exemplary composite contains a core of fluorinated ethylene 
polymer onto which has been grafted polystyrene (see Kent and 
Merrifield, Israel J. Chem. 1978, 17, 243) and van Rietschoten 
in "Peptides 2974", Y. Wolman, Ed., Wiley and Sons, New York, 
10 1975, pp. 113-116); and (4) contiguous solid supports other 
than PEPS, such as cotton sheets (Lebl and Eichler, Peptide 
Res. 1989, 2, 232) and hydroxypropylacrylate- coated 
polypropylene membranes (Daniels, et al., Tetrahedron Lett, 
1989, 4345), are suited for PNA and bis PNA synthesis as well. 
15 Whether manually or automatically operated, solid- 

phase PNA and bis PNA synthesis in the context of the present 
invention is normally performed batchwise. However, most of 
the syntheses may equally well be carried out in the con- 
tinuous-flow mode, where the support is packed into columns 
20 (Bayer, et al . , Tetrahedron Lett., 1970, 4503 and Scott, et 
al., J* Chromatogr. Sci., 1971, 9, 577). With respect to con- 
tinuous-flow solid-phase synthesis, the rigid poly (dimethyl - 
acrylamide) -Kieselguhr support (Atherton, et al., J. Chem. Soc. 
Chem. Commun., 1981, 1151) appears to be particularly 
25 successful, but another valuable configuration concerns the 
one worked out for the standard copoly (styrene -1% - 
divinylbenzene) support (Krchnak, et al., Tetrahedron Lett., 
1987, 4469) . 

While the solid-phase technique is presently 
3 0 preferred in the context of PNA and bis PNA synthesis, other 
methodologies or combinations thereof, for example, in 
combination with the solid-phase technique, apply as well: (1) 
the classical solution-phase methods for peptide synthesis 
(e.g., Bodanszky, "Principles of Peptide Synthesis" , Springer- 
35 Verlag, Berlin-New York 1984) , either by stepwise assembly or 
by segment/fragment condensation, are of particular relevance 
when considering especially large scale productions (gram, 
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kilogram) of PNA or bis PNA compounds; (2) the so-called 
"liquid-phase" strategy, which utilizes soluble polymeric 
supports such as linear polystyrene (Shemyakin, et al . , 
Tetrahedron Lett., 1965, 2323) and polyethylene glycol (PEG) 
5 (Mutter and Bayer, Angew. Chem., Int. Ed, Engl., 1974, 13, 88), 
is useful; (3) random polymerization {see, e.g., Odian, "Prin- 
ci pi es of Polymer i za fci on " , McGraw -Hill, New York (1970)) 
yielding mixtures of many molecular weights ( "polydisperse") 
peptide, PNA or bis PNA molecules are particularly relevant for 

10 purposes such as screening for antiviral effects; (4) a 
technique based on the use of polymer -supported amino acid 
active esters (Fridkin, et al., J. Am. Chem. Soc, t 1965, 87, 
4646), sometimes referred to as "inverse Merrifield synthesis" 
or "polymeric reagent synthesis", offers the advantage of 

15 isolation and purification of intermediate products, and may 
thus provide a particularly suitable method for the synthesis 
of medium-sized, optionally protected, PNA or bis PNA 
molecules, that can subsequently be used for fragment 
condensation into larger PNA or bis PNA molecules; (5) it is 

20 envisaged that PNA molecules may be assembled enzymatically by 
enzymes such as proteases or derivatives thereof with novel 
specificities {obtained, for example, by artificial means such 
as protein engineering) . Also, one can envision the 
development of "PNA ligases" for the condensation of a number 

25 of PNA fragments into very large PNA or bis PNA molecules; (6) 
since antibodies can be generated to virtually any molecule of 
interest, the recently developed catalytic antibodies 
(abzymes) , discovered simultaneously by the groups of Lerner 
(Tramantano, et al., Science, 1986, 234, 1566) and of Schultz 

30 (Pollack, et al * , Science, 1986, 234, 1570), should also be 
considered as potential candidates for assembling PNA and bis 
PNA molecules. Thus, there has been considerable success in 
producing abzymes catalyzing acyl- transfer reactions (see for 
example Shokat, et al.. Nature, 1989, 338, 269) (and references 

35 therein) . Finally, completely artificial enzymes, very 
recently pioneered by Stewart's group {Hahn, et al., Science, 
1990, 248, 1544), may be developed to suit PNA synthesis. The 
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design of generally applicable enzymes, ligases, and catalytic 
antibodies, capable of mediating specific coupling reactions, 
should be more readily achieved for PNA synthesis than for 
"normal" peptide synthesis since PNA molecules will often be 
5 comprised of only four different amino acids (one for each of 
the four native nucleobases) as compared to the twenty natural 
by occurring (proteinogenic) amino acids constituting peptides . 
In conclusion, no single strategy may be wholly suitable for 
the synthesis of a specific PNA or bis PNA molecule, and 
10 therefore, sometimes a combination of methods may work best. 

The present invention also is directed to therapeutic 
or prophylactic uses for PNAs and bis PNAs. Likely therapeutic 
and prophylactic targets include herpes simplex virus (HSV) , 
human papillomavirus (HPV) , human immunodeficiency virus (HIV) , 
15 candidia albicans, influenza virus, cytomegalovirus (CMV) , 
intracellular adhesion molecules (ICAM) , 5 -lipoxygenase (5-LO) , 
phospholipase A 2 (PLA 2 ) , protein kinase C (PKC) , and RAS 
oncogene. Potential applications of such targeting include 
treatments for ocular, labial, genital, and systemic herpes 
20 simplex I and II infections; genital warts; cervical cancer; 
common warts; Kaposi's sarcoma; AIDS; skin and systemic fungal 
infections; flu; pneumonia; retinitis and pneumonitis in 
immunosuppressed patients; mononucleosis; ocular, skin and 
systemic inflammation; cardiovascular disease; cancer; asthma ; 
2 5 psoriasis; cardiovascular collapse; cardiac infarction; 
gastrointestinal disease; kidney disease; rheumatoid arthritis; 
osteoarthritis; acute pancreatitis; septic shock; Crohn's 
disease; and bacterial infections. 

For therapeutic or prophylactic treatment, the PNAs 
30 and bis PNAs of the invention can be formulated in a pharmaceu- 
tical composition, which may include carriers, thickeners, 
diluents, buffers, preservatives, surface active agents and the 
like. Pharmaceutical compositions may also include one or more 
active ingredients such as antimicrobial agents, anti- 
35 inflammatory agents, anesthetics, and the like in addition to 
PNA or bis PNA. 
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The pharmaceutical composition may be administered 
in a number of ways depending on whether local or systemic 
treatment is desired, and on the area to be treated. 
Administration may be done topically (including ophthalmically, 
5 vaginally, rectally, intranasally) , orally, by inhalation, or 
parenterally, for example by intravenous drip or subcutaneous, 
intraperitoneal or intramuscular injection* 

Formulations for topical administration may include 
ointments, lotions, creams, gels, drops, suppositories, sprays, 
10 liquids and powders. Conventional pharmaceutical carriers, 
aqueous, powder or oily bases, thickeners and the like may be 
necessary or desirable. Coated condoms may also be useful. 

Compositions for oral administration include powders 
or granules, suspensions or solutions in water or non -aqueous 
15 media, capsules, sachets, or tablets. Thickeners, flavorings, 
diluents, emulsifiers, dispersing aids or binders may be 
desirable. 

Formulations for parenteral administration may 
include sterile aqueous solutions which may also contain 

20 buffers, diluents and other suitable additives. 

Dosing is dependent on severity and responsiveness 
of the condition to be treated, but will normally be one or 
more doses per day, with course of treatment lasting from 
several days to several months or until a cure is effected or 

25 a diminution of disease state is achieved. Persons of ordinary 
skill can easily determine optimum dosages, dosing 
methodologies and repetition rates. 

Treatments of this type can be practiced on a variety 
of organisms ranging from unicellular prokaryotic and eukaryo- 

30 tic organisms to multicellular eukaryotic organisms. Any 
organism that utilizes DNA-RNA transcription or RNA-protein 
translation as a fundamental part of its hereditary, metabolic 
or cellular control is susceptible to therapeutic and/or 
prophylactic treatment in accordance with the invention, 

35 Seemingly diverse organisms such as bacteria, yeast, protozoa, 
algae, all plants and all higher animal forms, including warm- 
blooded animals, can be treated. Further, since each cell of 
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multicellular eukaryotes can be treated since they include both 
DNA-RNA transcription and RNA-protein translation as integral 
parts of their cellular activity- Furthermore, many of the 
organelles (e.g., mitochondria and chloroplasts) of eukaryotic 
5 cells also include transcription and translation mechanisms. 
Thus, single cells, cellular populations or organelles can also 
be included within the definition of organisms that can be 
treated with therapeutic or diagnostic PNAs or bis PNAs. As 
used herein, therapeutics is meant to include the eradication 

10 of a disease state, by killing an organism or by control of 
erratic or harmful cellular growth or expression. 

The present invention also pertains to the 
advantageous use of PNA and bis PNA molecules in solid-phase 
biochemistry (see, e.g., "Solid-Phase Biochemistry ~ Analytical 

15 and Synthetic Aspects", W. H. Scouten, ed., John Wiley & Sons, 
New York, 1983) , notably solid-phase biosystems, especially 
bioassays or solid-phase techniques which concerns diagnostic 
detection/quantitation or affinity purification of 
complementary nucleic acids (see, e.g., "Affinity 

20 Chromatography - A Practical Approach", P. D. G. Dean, W. S. 
Johnson and F. A. Middle, eds., IRL Press Ltd., Oxford 1986; 
"Nucleic Acid Hybridization - A Practical Approach", B. D. 
Harnes and S. J. Higgins, IRL Press Ltd., Oxford 1987). 
Present day methods for performing such bioassays or 

25 purification techniques almost exclusively utilize "normal" or 
slightly modified oligonucleotides either physically adsorbed 
or bound through a substantially permanent covalent anchoring 
linkage to beaded solid supports such as cellulose, glass 
beads, including those with controlled porosity (Mizutani, et 

30 al., J. Chromatogr. t 1986, 356, 202), "Sephadex", "Sepharose" , 
agarose, polyacrylamide, porous particulate alumina, 
hydroxyalkyl methacrylate gels, diol -bonded silica, porous 
ceramics, or contiguous materials such as filter discs of nylon 
and nitrocellulose. One example employed the chemical 

35 synthesis of oligo-dT on cellulose beads for the affinity 
isolation of poly A tail containing mRNA (Gilham in "Methods 
in Enzymology," L. Grossmann and K. Moldave, eds., vol. 21, 
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part D, page 191, Academic Press, New York and London, 1971) . 
All the above-mentioned methods are applicable within the 
context of the present invention. However, when possible, 
covalent linkage is preferred over the physical adsorption of 
5 the molecules in question, since the latter approach has the 
disadvantage that some of the immobilized molecules can be 
washed out (desorbed) during the hybridi2ation or affinity 
process. There is, thus, little control of the extent to which 
a species adsorbed on the surface of the support material is 

10 lost during the various treatments to which the support is 
subjected in the course of the bioassay /purification procedure. 
The severity of this problem will, of course, depend to a large 
extent on the rate at which equilibrium between adsorbed and 
"free" species is established. In certain cases it may be 

15 virtually impossible to perform a quantitative assay with 
acceptable accuracy and/or reproducibility. Loss of adsorbed 
species during treatment of the support with body fluids, 
aqueous reagents or washing media will, in general, be expected 
to be most pronounced for species of relatively low molecular 

20 weight. In contrast with oligonucleotides, PNA and bis PNA 
molecules are easier to attach onto solid supports because they 
contain strong nucleophilic and/or electrophilic centers. In 
addition, the direct assembly of oligonucleotides onto solid 
supports suffers from an extremely low loading of the 

25 immobilized molecule, mainly due to the low surface capacity 
of the materials that allow the successful use of the state-of- 
the-art phosphoramidite chemistry for the construction of 
* oligonucleotides. (Beaucage and Caruthers, Tetrahedron Lett., 
1981, 22, 1859; Caruthers, Science, 1985, 232, 281). It also 

30 suffers from the fact that by using the alternative phosphite 
triester method (Letsinger and Mahadevan, J. Am. Chem. Soc. 
1976, 98, 3655) , which is suited for solid supports with a high 
surface/loading capacity, only relatively short oligo- 
nucleotides can be obtained. As for conventional solid-phase 

3 5 peptide synthesis, however, the latter supports are excellent 
materials for building up immobilized PNA and bis PNA molecules 
(the side-chain protecting groups are removed from the 
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synthesized PNA or bis PNA chain without cleaving the anchoring 
linkage holding the chain to the solid support) . Thus, PNA 
species benefit from the above -described solid-phase techniques 
with respect to the much higher (and still sequence- specif ic) 
5 binding affinity for complementary nucleic acids and from the 
additional unique sequence-specific recognition of (and strong 
binding to) nucleic acids present in double -stranded 
structures. They also can be loaded onto solid supports in 
large amounts, thus further increasing the sensitivity/capacity 

10 of the solid-phase technique- Further, certain types of 
studies concerning the use of PNA in solid-phase biochemistry 
can be approached, facilitated, or greatly accelerated by use 
of the recently- reported "light -directed, spatially 
addressable, parallel chemical synthesis" technology (Fodor, 

15 et al., Science, 1991, 251, 767), a technique that combines 
solid-phase chemistry and photolithography to produce thousands 
of highly diverse, but identifiable, permanently immobilized 
compounds (such as peptides) in a substantially simultaneous 
way. 

20 Additional objects, advantages, and novel features 

of this invention will become apparent to those skilled in the 
art upon examination of the following examples thereof, which 
are not intended to be limiting. 
General Remarks 

25 The following abbreviations are used in the 

experimental examples: egl, -NH-CH 2 -CH 2 -0-CH 2 -CH 2 -0-CH 2 -C (=0) - ; 
Aha, 6-amino hexanoic acid; DMF, N,N-dimethylf ormamide; DCC # 
N,N-dicyclohexyl carbodiimide; DCU, N, N-dicyclohexyl urea; THF, 
tetrahydrofuran; aeg, (2' -aminoethyl) glycine ; pfp, 

3 0 pentaf luorophenyl; Boc, tert-butoxycarbonyl ; Z, benzyloxy- 
carbonyl; NMR, nuclear magnetic resonance; s, singlet; d, 
doublet; dd, doublet of doublets; t; triplet; q, quartet; m, 
multiplet; b, broad; 6, chemical shift; 

NMR spectra were recorded on either a JEOL FX 90Q 

3 5 spectrometer, or a Bruker 250 MHz with tetramethylsilane as 
internal standard. Mass spectrometry was performed on a 
MassLab VG 12-250 quadropole instrument fitted with a VG FAB 
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15 



20 



* , ft K B Meltinq points were recorded on Buchi 
source and probe. Melting 

me- ting point apparatus and are uncorrected. N.N 
rae-tiny *> molecular sieves, 

Diethylamide was dried over 4 A 

distilled and stored over 4 A molecular sieves. Pyria 

qullty) was dried and stored over 4 A molecular sieves. Other 

Z t nls used were either the highest suality obtain^ or 

„e-e distilled before use . Dioxane was passed through basic 

alumina prior to use. ^^^^^l^^' 
ni-.rophenol. methyl bromoacetate . benzyloxycarbonyl *lori^e 

pentafluorophenol were all obtained through 
Lpany. Thymine, cytosine. adenine were all obtained through 

Si3ma ' v — ^orrr-anhv (Tic) was performed using 

Thin layer chromatograpny «° * 

i ~- cvet-ems- (1) chloroform rtriethyl 
th» following solvent systems. \±i 

"■ m «hh a nol 7-1-2; (2) methylene chloride methanol. 9:1; 
air.me: methanol, 7.1.2, * t were 

(3; chloroform methanol -.acetic acid B5.10.5. Sp 
visualized by UV (254 nm) or/and spraying with a n nhyd * 
solution (3 g ninhydrin in 1000 ml l-butanol and 30 ml acetic 
solution k 9 5 min and , aft er spraying, 

acid , after heating at 1.0 - ■ . Uca 
heating again. Tic plates were glass or plastic backed silica 
gel containing a fluorescent indicator. 

EXAMPLE 1 

tert-Butyl 4 -nitrophenyl carbonate (1) 

sodium carbonate (29.14 g; 0.275 mol) and 4 

ni ,rophenol (12.75 g; 91.6 — » ^V"'^" 
.v . Boc-anhydride (20.0 «, 91.6 mmol) was transferred to the 
• :: xt ure with dioxane ,50 ml, . The mixture was re fluxed for 1 
h , cooled to 0-C. filtered and concentrated to ^ ' *f ^" 
p-ured into water (350 ml) at 0-C. After stirring for 1/2 h 
, t- product was collected by filtration, washed with ^water. and 
t-n dried over sicapent. in vacuo. Yield 21.3 , 
n-. 0-74. 5-C (litt. 78. 5-79.5-0, . Anal. f-W* found(calc> 

C: 55.20(55.23, H= 5.61(5.48, N: 5.82(5.85). 
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EXAMPLE 2 

N- ( 2 - Boc - aminoe thy 1 ) glyc ine ( 2 ) 

The title compound was prepared by a modification of 
the procedure by Heimer, et al. Int. J. Pept., 1984, 23, 203- 
5 211 N- (2' -Aminoethyl) glycine (3.00 g; 25.4 mmol) was dissolved 
in water (50 ml) , dioxane (50 ml) was added, and the pH was 
adjusted to 11.2 with 2 N sodium hydroxide. tert-Butyl-4- 
nitrophenyl carbonate (1, 7.29 g; 30.5 mmol) was dissolved in 
dioxane (40 ml) and added dropwise over a period of 2 h, during 

10 which time the pH was maintained at 11.2 with 2 N sodium 
hydroxide. The pH was adjusted periodically to 11.2 for three 
more hours and then the solution was left overnight. The 
solution was cooled to 0°C and the pH was carefully adjusted 
to 3.5 with 0.5 M hydrochloric acid. The aqueous solution was 

15 washed with chloroform (3 x 200 ml), the pH adjusted to 9.5 
with 2N sodium hydroxide and the solution was evaporated to 
dryness, in vacuo (14 mmHg) . The residue was extracted with 
DMF (25+2x10 ml) and the extracts filtered to remove excess 
salt. This results in a solution of the title compound in 

20 about 60% yield and greater than 95% purity by tic (system 1 
and visualised with ninhydrin, Rf=0.3) . The solution was used 
in the following preparations of Boc-aeg derivates without 
further purification. 

EXAMPLE 3 
25 N-l-Carboxymethyl thymine (3) 

This procedure is different from the literature 
synthesis, but is easier, gives higher yields, and leaves no 
unreacted thymine in the product. To a suspension of thymine 
(4 0.0 g; 0.317 mol) and potassium carbonate (87.7 g; 0.634 

3 0 mmol) in DMF (900 ml) was added methyl bromoacetate (30.00 ml; 
0.317 mmol). The mixture was stirred vigorously overnight 
under nitrogen. The mixture was filtered and evaporated to 
dryness, in vacuo. The solid residue was treated with water 
(300 ml) and 4 N hydrochloric acid (12 ml), stirred for 15 min 

35 at 0°C, filtered, and washed with water (2 x 75 ml) . The 
precipitate was treated with water (120 ml) and 2N sodium 
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hydroxide (60 ml), and was boiled for 10 minutes. The mixture 
was cooled to 0°C, filtered, and the pure title compound was 
precipitated by the addition of 4 N hydrochloric acid (70 ml) . 
Yield after drying, in vacuo over sicapent: 37.1 g (64%) . 1 H- 
5 NMR: (90 MHz; DMSO-d*) : 11.33 ppm (s,lH,NH); 
7.49{d, J=0.92Hz,lH # ArIJ) ; 4,38 (s^H,^); 1.76 (d, J«0 . 92Hz , T- 
CH 3 ) 

EXAMPLE 4 

N-l-Carboxyme thy 1 thymine pentaf luorophenyl ester (4} 

10 N-l-Carboxymethyl thymine (3, 10. 0g; 54.3 mmol) and 

pentaf luorophenol (10,0 g; 54.3 mmol) were dissolved in DMF 
(100 ml) and cooled to 5°C in ice water. DCC (13.45 g; 65.2 
mmol) then was added. When the temperature passed below 5°C, 
the ice bath was removed and the mixture was stirred for 3 h 

15 at ambient temperature. The precipitated DCU was removed by 
filtration and washed twice with DMF (2 x 10 ml) . The combined 
filtrate was poured into ether (1400 ml) and cooled to 0°C. 
Petroleum ether (1400 ml) was added and the mixture was left 
overnight. The title compound was isolated by filtration and 

20 was washed thoroughly with petroleum ether. Yield: 14.8 
g(78%). The product was pure enough to carry out the next 
reaction, but an analytical sample was obtained by 
recrystallization from 2~propanol. M.p. 200.5-206°C Anal, for 
C :? H 7 F 5 N 2 0 4 . Found (calc. ) C: 44.79(44.59); H; 2.14(2*01) N: 

25 8.13(8.00). FAB-MS: 443 (M+l+glycerol ) , 351 (M+l) . 1 H-NMR (90 
MHz; DMS0-d 6 ) : 11.52 ppm (s,lH,NH); 7.64 (s,lH,ArH); 4.99 
(s,2H, Ofc) ; 1.76 (s,3H,CH 3 ). 

EXAMPLE 5 

1- (Boc-aeg) thymine (5) 

30 To the DMF-solution from Example 2 was added triethyl 

amine (7.08 ml; 50.8 mmol) followed by N-l-carboxymethyl thymine 
pentaf luorophenyl ester (4, 4.45 g; 12.7 mmol). The resultant 
solution was stirred for 1 h. The solution was cooled to 0°C 
and treated with cation exchange material ("Dowex SOW X-8", 40 

35 g) for 20 min. The cation exchange material was removed by 
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filtration, washed with dichlorome thane (2 x 15 ml) # and 
dichloromethane (150 ml) was added. The resulting solution was 
washed with saturated sodium chloride, dried over magnesium 
sulfate, and evaporated to dryness, in vacuo, first by a water 
5 aspirator and then by an oil pump. The residue was shaken with 
water (50 ml) and evaporated to dryness. This procedure was 
repeated once. The residue then was dissolved in methanol (75 
ml) and poured into ether (600 ml) and petroleum ether (1.4 L) . 
After stirring overnight, the white solid was isolated by 
10 filtration and was washed with petroleum ether. Drying over 
sicapent, in vacuo, gave 3.50 g (71.7%). M.p. 142-147°C. Anal. 

for C u H 24 N 4 0 7 . Found(calc) C: 49.59(50.00) H: .6.34(6.29) N; 

14.58(14.58). l H-NMR (250 MHz, DMS0-d 6 ) : Due to the limited 
rotation around the secondary amide bond several of the signals 

15 were doubled in the ratio 2:1, (indicated in the list by m j . for 
major and mi. for minor). 12.73 ppm (b,lH, -C0 2 H) ; 11.27 ppm 
(s, m j . , imide) ; 11.25 ppm (s, mi., imide) ; 7.30 ppm (s, m j . , 
ArH) ; 7.26 ppm (s, mi., ArH) ; 6.92 ppm (unres. t, mj . , BocNH) ; 
6.73 ppm (unres. t; mi., BocNH); 4.64 ppm (s, m j . , T-CH 2 -C0-); 

20 4.47 ppm (s, mi., T-CH 2 -CO-); 4.19 ppm (s, mi., C0NRCg 2 C0 2 H) ; 
3.97 ppm { s , m j . , C0NRCH 2 C0 2 H) ; 3.41-2.89 ppm {unres . m, - 
CH 2 CH 2 - and water) ; 1.75 ppm (s,3H, T-CH 3 ) ; 1.38 ppm (s, 9H, t- 
Bu) . 13 C-NMR: 170.68 ppm (CO); 170.34 (CO); 167.47 (CO); 167.08 
(CO); 164.29 (CO); 150.9 (C5"); 141.92 (C6"); 108.04 (C2'); 

25 77.95 and 77.68 (Thy-C&CO) ; 48.96, 47.45 and 46.70 (-CH 3 CH 3 - 
and NCH 2 C0 2 H) ; 37.98 (Thy-CH 5 ) ; 28.07 (t-Bu) . FAB-MS: 407 
(M+Na*) ; 385 (M+H*) . 

EXAMPLE 6 

1- (Boc-aeg) thymine pen tafluorophenyl ester (6, Boc-Taeg . OPfp) 

30 1- (Boc-aeg) thymine (5) (2.00 g; 5.20 mmol) was 

dissolved in DMF (5 ml) and methylene chloride {15 ml) was 
added. Pentaf luorophenol (1.05 g; 5.72 mmol) was added and the 
solution was cooled to 0°C in an ice bath. DDC then was added 
(1.29 g; 6.24 mmol) and the ice bath was removed after 2 min. 

3 5 After 3 h with stirring at ambient temperature, the 
precipitated DCU was removed by filtration and washed with 
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methylene chloride. The combined filtrate was washed twice 
wi-h aqueous sodium hydrogen carbonate and once with saturated 
sodium chloride, dried over magnesium sulfate, and evaporated 
to dryness, in vacuo. The solid residue was dissolved in 

5 dioxane (150 ml) and poured into water (200 ml) at 0«C. The 
title compound was isolated by filtration, washed with water, 
and dried over sicapent, in vacuo. Yield: 2.20 g (77%). An 
analytical sample was obtained by recrystallisation from 2- 
propanol. M.p. 174-175.5-C. Analysis for C„H 23 N 4 0 7 F 5 , found <- 

L0 calc.): C: 48.22(48.01); H: 4.64(4.21); N: 9.67(10.18). H-NMR 

(2-0 MHz, CDCl 3 ):Due to the limited rotation around the 
secondary amide bond several of the signals were doubled in the 
ra-io 6:1 (indicated in the list by mj . for major and ma. for 
minor). 7.01 ppm (s, mi., ArH) ; 6.99 ppm (s, mj . , ArH) ; 5.27 
15 pp. (unres. t, BocNH) ; 4.67 ppm (s, m j . , T-CH.-C0-) ; 4.60 ppm 
(s mi., T-CH 2 -CO-); 4.45 ppm (s, mj . , CONRC&CO^fp) ; 4.42 ppm 
(s', mi., CDHRCH.CQ.Pfp) ; 3.64 ppm <t , 2H,BocNHCH 2 CH 2 -) ; 3.87 ppm 
(-q-^H.BocHHCfcCH,-), 1 . 44 (s, 9H, t-Bu) . FAB-MS: 551 (10; M + l) ; 
45= (10; M+l-tBu) ; 451 (80; -Boc) . 

20 EXAMPLE 7 

N-Benzyloxycarbonyl-N-Mbocaminoethyl) glycine (7) 

Aminoethyl glycine (52.86 g; 0.447 mol) was dissolved 
ir water (900 ml) and dioxane (900 ml) was added. The P H was 
adjusted to 11.2 with 2N NaOH. While the P H was kept at 11.2, 

25 te-t-butyl-p-nitrophenyl carbonate (128.4 g; 0.537 mol) was 
dissolved in dioxane (720 ml) and added dropwise over the 
cc-Be of 2 hours. The P H was kept at 11.2 for at least three 
mc^e hours and then left with stirring overnight. The yellow 
sc-ution was cooled to 0°C and the pH was adjusted to 3.5 with 

30 2 N HC1. The mixture was washed with chloroform (4x100 ml), 
a-* the pH of the aqueous phase was readjusted to 9.5 with 2 
N NaOH at 0°C. Benzyloxycarbonyl chloride (73.5 ml; 0.515 mol) 
was added over half an hour, while the pH was kept at 9.5 with 
2 %• NaOH . The pH was adjusted frequently over the next 4 

35 hours, and the solution was left with stirring overnight. On 
t- following day the solution was washed with ether (3x600 ml) 
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and the pH of the solution was afterwards adjusted to 1.5 with 
2 N HC1 at 0°C. The title compound was isolated by extraction 
with ethyl acetate (5x1000 ml) . The ethyl acetate solution was 
dried over magnesium sulfate and evaporated to dryness, in 
5 vacuo. This afforded 138 g, which was dissolved in ether (300 
ml) and precipitated by the addition of petroleum ether (1800 
ml). Yield 124.7 g (79%). M.p. 64.5-85 °C. Anal, for C a7 H a ,NA 
found(calc) C: 58-40(57.94); H: 7.02 (6.86); N: 7.94(7.95). 
l H-NMR (250 MHz, CDC1 3 ) 7.33 & 7.32 (5H, Ph) ; 5,15 & 5.12 (2H, 
10 PhCHj) ; 4.03 & 4.01 (2H, NC&COaH) ; 3.46 (b, 2H, BocNHCHjCHj) ; 
3.28 (b, 2H, BocNHCiLCHJ ; 1.43 & 1,40 (9H, r Bu) . HPLC (260 nm) 
2C,7lmin. (80.2%) and 21.57 min. (19.8%). The UV- spectra (200 
nm - 300 nm) are identical, indicating that the minor peak 
consists of Bis-Z-AEG. 

15 EXAMPLE 8 

N' -Boc- amino ethyl glycine ethyl ester (8) 

N-Benzyloxycarbonyl-N' - (bocaminoethyl) glycine (7 , 
60.0 g; 0.170 mol) and N, N-dimethyl-4-aminopyridine (6.00 g) 
were dissolved in absolute ethanol (500 ml) , and cooled to 0°C 

20 before the addition of DCC (42.2 g; 0.204 mol). The ice bath 
was removed after 5 minutes and stirring was continued for 2 
more hours. The precipitated DCU (32.5 g dried) was removed 
by filtration and washed with ether (3x100 ml) . The combined 
filtrate was washed successively with diluted potassium 

25 hydrogen sulfate (2x4 00 ml) , diluted sodium hydrogencarbonate 
(2x4 00 ml) and saturated sodium chloride (1x400 ml) . The 
organic phase was filtered, then dried over magnesium sulfate, 
and evaporated to dryness, in vacuo, which yielded 66.1 g of 
ar. oily substance which contained some DCU. 

30 The oil was dissolved in absolute ethanol (600 ml) 

and was added 10% palladium on carbon (6.6 g) was added. The 
solution was hydrogenated at atmospheric pressure, where the 
reservoir was filled with 2 N sodium hydroxide. After 4 hours, 
3.3 L was consumed out of the theoretical 4.2 L. The reaction 

3 5 mixture was filtered through celite and evaporated to dryness, 
in vacuo, affording 39.5 g (94%) of an oily substance. A 13 
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g portion of the oily substance was purified by silica gel (600 
g SiO,) chromatography • After elution with 300 ml 20% 
petroleum ether in methylene chloride, the title compound was 
eluted with 1700 ml of 5% methanol in methylene chloride. The 
5 solvent was removed from the fractions with satisfactory 
purity, in vacuo and the yield was 8.49 g. Alternatively 10 
g of the crude material was purified by Kugel Rohr 
distillation. l H-NMR (250 MHz, CD 3 0D); 4.77 (b. s, NH) ; 4.18 
(q, 2H, MeQH,-); 3.38 (s, 2H, NC&CO^Et) ; 3.16 (t, 2H, 
10 BocNHCHjCH 2 ) ; 2.68 (t, 2H, BocNHC^CH,) ; 1,43 (s, 9H, <Bu) and 
1.26 (t, 3H, CH,) "C-NMR 171.4 (COEt) ; 156.6 (CO); 78.3 
((CHJjC); 59.9 (CH 2 ) ; 49.0 (CH 3 ) ; 48.1 (CH 2 ) ; 39.0 (CH 2 ); 26.9 

(CH a ) and 12.6 <CH 3 ) . 

EXAMPLE 9 

15 N' -Boc- amino ethyl glycine methyl ester (9) 

The above procedure was used, with methanol being 
substituted for ethanol. The final product was purified by 
flash column chromatography. 

EXAMPLE 10 

20 1- (Boc -aeg) thymine ethyl ester (10) 

N' -Boc-aminoethyl glycine ethyl ester (8, 13.5 g; 
54.8 mmol) , DhbtOH {9.84 g; 60.3 mmol) and N-l-carboxymethyl 
thymine (4, 11.1 g; 60.3 mmol) were dissolved in DMF (210 ml) . 
Methylene chloride (210 ml) then was added. The solution was 

25 cooled to 0°C in an ethanol/ice bath and DCC (13.6 g; 65.8 
■ mmol) was added. The ice bath was removed after 1 hour and 
stirring was continued for another 2 hours at ambient 
temperature. The precipitated DCU was removed by filtration 
and washed twice with methylene chloride (2 x 75 ml) . To the 

3 0 combined filtrate was added more methylene chloride (650 ml) , 
The solution was washed successively with diluted sodium 
hydrogen carbonate (3 x 500 ml) , diluted potassium hydrogen 
sulfate (2 x 500 ml) , and saturated sodium chloride (1 x 500 
ml) . Some precipitate was removed from the organic phase by 

35 filtration, The organic phase was dried over magnesium sulfate 
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and evaporated to dryness, in vacuo. The oily residue was 
dissolved in methylene chloride (150 ml), filtered, and the 
title compound was precipitated by the addition of petroleum 
ether (350 ml) at 0°C. The methylene chloride/petroleum ether 
5 procedure was repeated once. This afforded 16.0 g (71%) of the 
title compound which was more than 99% pure by HPLC. 

EXAMPLE 11 

1- (Boc-aeg) thymine (6a) 

10 The material from Example 10 was suspended in THF 

(194 ml, gives a 0.2 M solution), and 1 M aqueous lithium 
hydroxide (116 ml) was added. The mixture was stirred for 45 
minutes at ambient temperature and then filtered to remove 
residual DCU. Water (40 ml) was added to the solution which 

15 was then washed with methylene chloride (300 ml) . Additional 
water (30 ml) was added, and the alkaline solution was washed 
once more with methylene chloride (150 ml) . The aqueous 
solution was cooled to 0°C and the pH was adjusted to 2 by the 
dropwise addition of 1 N HC1 (approx. 110 ml) . The title 

20 compound was extracted with ethyl acetate (9 x 200 ml), the 
combined extracts were dried over magnesium sulfate and were 
evaporated to dryness, in vacuo. The residue was evaporated 
once from methanol, which after drying overnight afforded a 
colorless glassy solid. Yield 9.57 g (64 %) . HPLC > 98% 

25 R--14.8 min . Anal, for C 16 H 2 «N«O 7 o0 . 25 H 2 0 Found (calc.) C: 
49.29(49.42); H: 6.52(6.35); N: 14.11(14.41). Due to the 
limited rotation around the secondary amide, several of the 
signals were doubled in the ratio 2:1 (indicated in the list 
by mj. for major and mi. for minor). 'H-NMR (250 MHz, DMSO- 

30 d £ ): 12.75 (b.s., 1H, C0 2 H) ; 11.28 (s, "1H", mj . , imide NH) ; 
11.26 (s, "1H", mi., imide NH) ; 7.30 <s, «1H", mj . , T H-6); 
7.26 (s, "1H", mi., T H-6) ; 6.92 (b.t., M 1H'\ m j . , BocNH); 6.73 
(b.t., "1H", mi., BocNH); 4.64 (s, "2H", m j . , CH 2 CON) ; 4.46 
(s, M 2H»\ mj., CH 2 C0N); 4.19 (s, »2H», mi., CH 2 C0 2 H) ; 3.97 (s, 
35 "2H", mj . , QkCOjH) ; 3.63-3.01 (unresolved m, includes water, 
CHjCHj); 1.75 (s, 3H, CH 3 ) and 1.38 (s, 9H, *Bu) . 
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EXAMPLE 12 

N- 4 -Benzyloxycarbonyl cytosin (12) 

Over a period of about 1 h, benzyloxycarbonyl 
chloride (52 ml; 0.36 mol) was added dropwise to a suspension 
5 of cytosine (8, 20.0 g;0.18 mol) in dry pyridine (1000 ml) at 
0°C under nitrogen in oven-dried equipment. The solution then 
was stirred overnight, after which the pyridine suspension was 
evaporated to dryness, in vacuo. Water (200 ml) and 4 N 
hydrochloric acid were added to reach pH -1, The resulting 

10 white precipitate was filtered off, washed with water and 
partially dried by air suction. The still -wet precipitate was 
boiled with absolute ethanol (500 ml) for 10 min, cooled to 
0°C, filtered, washed thoroughly with ether, and dried, in 
vacuo. Yield 24.7 g (54%). M.p.>250°C. Anal, for C^H^NjOj. 

15 Found(calc) ; C: 58.59(58,77); H: 4.55(4.52); N: 17.17(17.13). 

No NMR spectra were recorded since it was not possible to get 
the product dissolved. 

EXAMPLE 13 

N- 4 - Benzyloxycarbonyl -N- 1 - carboxyme thyl cytosine (13) 

20 In a three necked round bottomed flask equipped with 

mechanical stirring and nitrogen coverage was placed methyl 
bromacetate (7.82 ml; 82. 6 mmol) and a suspension of N-4- 
benzyloxycarbonyl cytosine (12, 2l.0g;82.6 mmol) and potassium 
carbonate (11.4 g;82.6 mmol) in dry DMF (900 ml) . The mixture 

25 was stirred vigorously overnight, filtered, and evaporated to 
dryness, in vacuo. Water (300 ml) and 4 N hydrochloric acid 
- (10 ml) were added, the mixture was stirred for 15 minutes at 
0°C, filtered, and washed with water (2 x 75 ml) . The isolated 
precipitate was treated with water (120 ml) , 2N sodium 

30 hydroxide (60 ml) , stirred for 30 min, filtered, cooled to 0°C, 
and 4 N hydrochloric acid (35 ml) was added. The title 
compound was isolated by filtration, washed thoroughly with 
water, recrystallized from methanol (1000 ml) and washed 
thoroughly with ether. This afforded 7.70 g (31%) of pure 

3 5 compound. The mother liquor from the recrystallization was 
reduced to a volume of 200 ml and cooled to 0°C. This afforded 
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an additional 2.30 g of a material that was pure by tic but had 
a reddish color. M.p. 266-274 6 C. Anal, for C 14 H 13 N 3 O s , 
Found(calc) ; C: 55.41(55.45); H: 4.23(4.32); N: 14.04(13.86). 
X H-NMR (90 MHz; DMSO-d € ) : 8.02 ppm (d, J=7 . 32Hz , 1H # H-6) ; 7.39 
5 (s,5H,Ph); 7.01 (d, J=7 . 32Hz , 1H , H-5) ; 5.19 (s , 2H, PhCH 2 - ) ; 4.52 
(s,2H) . 

EXAMPLE 14 

N-4-Benzyloxycarbonyl -N-l-carboxymethyl cytosine penta- 
fluorophenyl ester (14) 

10 N-4-Benzyloxycarbonyl-N-l-carboxymethyl -cytosine (13, 

4.00 g; 13.2 mmol) and pentaf luorophenol (2.67 g; 14.5 mmol) 
were mixed with DMF (70 ml) , cooled to 0°C with ice-water, and 
DCC (3.27 g; 15.8 mmol) was added. The ice bath was removed 
after 3 min and the mixture was stirred for 3 h at room 

15 temperature. The precipitated DCU was removed by filtration, 
washed with DMF, and the filtrate was evaporated to dryness, 
ir vacuo (0.2 mmHg) . The solid residue was treated with 
methylene chloride (250 ml), stirred vigorously for 15 min, 
filtered, washed twice with diluted sodium hydrogen carbonate 

20 and once with saturated sodium chloride, dried over magnesium 
sulfate, and evaporated to dryness, in vacuo. The solid 
residue was recrystallized from 2-propanol (150 ml) and the 
crystals were washed thoroughly with ether. Yield 3.40 g 
(55%). M.p. 241-245°C. Anal, f or C 20 H 12 N 3 F 5 O 5 . Found (calc .) ; C: 

25 51.56(51.18); H: 2.77(2.58); N: 9.24 (8 .95) ^H-NMR (90 MHz ; 
CDC1 3 ) : 7.66 ppm <d, J=7.63Hz,lH,H-6) ; 7.37 (s,5H,Ph); 7.31 
(d, J=7.63Hz,lH,H-5) ; 5.21 (s , 2H, PhCH 2 - ) ; 4.97 (s,2H,NCH 2 -) . 
FA3-MS: 470 (M+l) 

EXAMPLE 15 

30 N- 4 -Benzyloxycarbonyl-l-Boc-aeg- cytosine (15) 

To a solution of (N-Boc-2-aminoethyl) glycine 2, in 
DM", prepared as described in Example 2, was added triethyl 
amine (7.00 ml; 50.8 mmol) and N-4-benzyloxycarbonyl-N-l- 
carboxymethyl -cytosine pentaf luorophenyl ester (14, 2.70 g; 
35 5.75 mmol). After stirring the solution for 1 h at room 
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10 



15 



20 



teimoerature, methylene chloride (150 ml), saturated sodium 
chloride (250 ml) , and 4 N hydrochloric acid to pH -1 were 
added. The organic layer was separated and washed twice wxth 
saturated sodium chloride, dried over magnesium sulfate, and 
evaporated to dryness, in vacuo, first with a water aspirator 
and then with an oil pump. The oily residue was treated wxth 
water (25 ml) and was again evaporated to dryness, in vacuo. 
Th<s procedure then was repeated. The oily residue (2.80 g) 
was then dissolved in methylene chloride (100 ml), petroleum 
ether (250 ml) was added, and the mixture was stirred 
overnight. The title compound was isolated by filtration and 
washed with petroleum ether. Tic (system 1) indicated 
substantial quantities of pentaf luorophenol , but no attempt was 
made to remove it. Yield: 1.72 g (59%). M.p. 156°C (decomp . ) . 
iH-NMR (250 MHz, CDC1 3 ) : Due to the limited rotation around the 
secondary amide bond several of the signals were doubled in the 
ratio 2:1, (indicated in the list by mj . for major and mi. for 
minor). 7.88 ppm (dd.lH.H-6); 7.39 (m,5H,Ph); 7.00 (dd.lH.H-5) , 
6.92 (b,lH,BocNH); 6.74 ( b , 1H , ZNH) - ? ; 5.19 (s , 2H, Ph-CHj) ; 4.81 
ppm (s, mj., Cyt-CH 2 -CO-); 4.62 ppm (s, mi., Cyt-CH 2 -CO-) ; 4.23 
(s mi., CONRC&CO.H) ; 3.98 ppm (s, mj., CONRCfi 2 C0 3 H) ; 3.42-3.02 
(unres. m, -CH 2 CH 2 - and water);!. 37 (s.9H,tBu). FAB-MS: 504 
(K+l) ; 448 (M+l-tBu) . 



25 



30 



35 



EXAMPLE 16 

N-4- Benzyloxycarbonyl - 1 - Hoc -aeg- cy tosine pentaf luor ophenyl 
ester (16) 

N-4-Benzyloxycarbonyl-l-Boc-aeg-cytosine (15, 1.50 
g . 2 98 mmol) and pentaf luorophenol (548 mg ; 2.98 mmol) was 
dissolved in DMF (10 ml) Methylene chloride (10 ml) was added, 
tJ- reaction mixture was cooled to 0°C in an ice bath, and DCC 
(676 mg; 3.28 mmol) was added. The ice bath was removed after 
3 min and the mixture was stirred for 3 h at ambient 
ternoerature . The precipitate was isolated by filtration and 
washed once with methylene chloride. The precipitate was 
dissolved in boiling dioxane (150 ml) and the solution was 
cooled to 15-C, whereby DOT precipitated. The DOT was removed 
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by filtration and the resulting filtrate was poured into water 
(250 ml) at 0°C. The title compound was isolated by 
filtration, was washed with water, and dried over sicapent, in 
vacuo. Yield 1.30 g (65%) . Analysis for C 29 H ae N s O B F 5 . 
5 Found(calc) ; C: 52.63(52.02); H: 4.41(4.22); N: 10.55(10.46). 
1 H-NMR (250 MHz ; DMSO-dg) : showed essentially the spectrum of 
the above acid, most probably due to hydrolysis of the ester. 
FAB-MS: 670 (M+l) ; 614 (M+l-tBu) 



EXAMPLE 17 

10 N-4-Benzyloxycarbonyl-l- (Boc-aeg) cytosine (17) 

N' -Boc-aminoethyl glycine ethyl ester (8, 5.00 g; 
20.3 mmol), DhbtOH (3.64 g; 22.3 mmol) and N-4-benzyloxy- 
carbonyl-N-l-carboxymethyl cytosine (13, 6.77 g; 22.3 mmol) 
were suspended in DMF (100 ml) . Methylene chloride (100 ml) 

15 then was added. The solution was cooled to 0°C and DCC (5.03 
g; 24.4 mmol) was added. The ice bath was removed after 2 h 
and stirring was continued for another hour at ambient 
temperature. The reaction mixture then was evaporated to 
dryness, in vacuo. The residue was suspended in ether (100 ml) 

20 and stirred vigorously for 30 min. The solid material was 
isolated by filtration and the ether wash procedure was 
repeated twice. The material was then stirred vigorously for 
15 min with dilute sodium hydrogencarbonate (aprox. 4% 
solution, 100 ml), filtered and washed with water. This 

25 procedure was then repeated once, which after drying left 17.0 
g of yellowish solid material. The solid was then boiled with 
dioxane (200 ml) and filtered while hot. After cooling, water 
(200 ml) was added. The precipitated material was isolated by 
filtration, washed with water, and dried. According to HPLC 

3 0 (observing at 260 nm) this material has a purity higher than 
99%, besides the DCU. The ester was then suspended in THF (100 
ml), cooled to 0°C, and 1 N LiOH (61 ml) was added. After 
stirring for 15 minutes, the mixture was filtered and the 
filtrate was washed with methylene chloride (2 x 150 ml) . The 

35 alkaline solution then was cooled to 0°C and the pH was 
adjusted to 2 . 0 with 1 N HC1 . The title compound was isolated 
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by filtration and was washed once with water, leaving 11.3 g 
of a white powder after drying . The material was suspended in 
methylene chloride (300 ml) and petroleum ether (300 ml) was 
added. Filtration and wash afforded 7,1 g (69%) after drying. 
5 HPLC showed a purity of 99% Rr= 19,5 min, and a minor impurity 
at 12.6 min (approx. 1%) most likely the Z-de protected 
monomer. Anal, for C„H 29 N 5 0 8 found (calc.) C; 54.16(54.87); H: 
5.76(5.81) and N: 13.65(13.91). *H-NMR (250 MHz, DMSO-d 6 ) . 
10.78 (b.s, 1H, COjH) ; 7.88 (2 overlapping dublets, 1H, Cyt H- 

10 5); 7,41-7,32 (m, 5H, Ph) ; 7.01 (2 overlapping doublets, 1H, 
Cyt H-6) ; 6.94 & 6.78 (unres. triplets, 1H, BocNH) ; 5.19 (s, 
2K, PhCHj) ; 4.81 & 4.62 (s, 2H, QLCON) ; 4.17 & 3,93 ( s , 2H f 
CH ; C0,H) ; 3.42-3.03 (m, includes water, C&CHj) and 1.38 & 1.37 
(s, 9H, *Bu) . "C-NMR. 150.88; 128.52; 128.18; 127.96; 93.90; 

15 66.53; 49.58 and 28.22. IR: Frequency in cm" 1 (intensity). 
3423 (26.4), 3035 (53.2), 2978(41.4), 1736(17.3), 1658(3.8), 
1563(23.0), 1501(6.8) and 1456 (26.4). 

EXAMPLE 18 

9-Carboxymethyl adenine ethyl eater (18) 

20 Adenine (10.0 g, 74 mmol) and potassium carbonate 

(10.29 g, 74.0 mmol) were suspended in DMF and ethyl 
bromoacetate (8.24 ml, 74 mmol) was added. The suspension was 
stirred for 2.5 h under nitrogen at room temperature and then 
filtered. The solid residue was washed three times with DMF 

25 (10 ml) . The combined filtrate was evaporated to dryness, in 
vacuo. The yellow-orange solid material was poured into water 
(200 ml) and 4 N HCl was added to pH~6. After stirring at 0°C 
for 10 min, the solid was filtered off, washed with water, and 
recrystallized from 96% ethanol (150 ml) . The title compound 

30 was isolated by filtration and washed thoroughly with ether. 
Yield 3.4 g (20%). M.p. 215.5-220°C. Anal, for C 9 H u N 5 0 2 
found(calc. ) : C: 48.86 (48.65); H: 5.01(4.91); N: 31.66(31.42). 
^-NMR (250 MHz ; DMSO-dJ : (s, 2H f H-2 & H-8) , 7.25 (b. s. # 2H, 
NH 2 ) , 5.06 (s, 2H, NCH 2 ) , 4.17 (q, 2H, J=7.11 Hz, OCH 2 ) and 1.21 

35 (t, 3H, J=7.13 Hz, NCH 2 ) . 13 C-NMR. 152.70, 141.30, 61.41, 43.97 
and 14.07. FAB-MS. 222 (MH+) . IR; Frequency in cm" 1 
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(intensity). 3855 (54.3), 3274(10.4), 3246(14.0), 3117(5.3), 
2989(22.3), 2940(33.9), 2876(43.4), 2753(49.0), 2346(56.1), 
2106 (57.1) , 1899 (55.7) , 1762 (14.2) , 1742 (14.2) , 1742 (1.0) , 
1671(1.8) , 1644(10.9) , 1606(0.6) , 1582(7.1) , 1522(43.8) , 
5 1477(7.2), 1445(35.8) and 1422(8.6). The position of 
alkylation was verified by X-ray crystallography on crystals, 
which were obtained by recrystallization from 96% ethanol. 

Alternatively, 9-carboxymethyl adenine ethyl ester 
18, can be prepared by the following procedure. To a 

10 suspension of adenine (50.0 g, 0.37 mol) in DMF (1100 ml) in 
2 L three -necked flask equipped with a nitrogen inlet, a 
mechanical stirrer and a dropping funnel was added 16.4 g 
(0.407 mol) haxane washed sodium hydride- mineral oil 
dispersion. The mixture was stirred vigorously for 2 hours, 

15 then ethyl bromacetate 75 ml, 0.67 mol) was added dropwise over 
the course of 3 hours. The mixture was stirred for one 
additional hour, whereafter tic indicated complete conversion 
of adenine. The mixture was evaporated to dryness at 1 mmHg 
and water (500 ml) was added to the oily residue which caused 

20 crystallization of the title compound. the solid was 
recrystallized from 06% ethanol (600 ml) . Yield after drying 
53.7 (65.6%). HPLC (215 nm) purity > 99.5%. 

EXAMPLE 19 

N- 6 -Benzyloxycarbonyl- 9-carboxymethyl adenine ethyl eeter (19) 

25 9-Carboxymethyladenine ethyl ester (18, 3.40 g, 15.4 

mmcl) was dissolved in dry DMF (50 ml) by gentle heating, 
ccoled to 20°C, and added to a solution of N-ethyl-benzyloxy- 
carbonyl imidazole tetraf luoroborate (62 mmol) in methylene 
chloride (50 ml) over a period of 15 min with ice-cooling. 

30 Sc-e precipitation was observed. The ice bath was removed and 
the solution was stirred overnight. The reaction mixture was 
treated with saturated sodium hydrogen carbonate (100 ml) . 
Afcer stirring for 10 min, the phases were separated and the 
organic phase was washed successively with one volume of water, 

35 dilute potassium hydrogen sulfate (twice), and with saturated 
scdium chloride. The solution was dried over magnesium sulfate 
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15 



. a m drvness in vacuo, which afforded 11 g of an 
and evaporated to dryness, i« „««->, v iene 
\ The material was dissolved in methylene 

oily material. Tne matexxa . with 

(25 ml), cooled to 0<>C, and precipitated with 
chloride (25 ml) , procedure was repeated once to 

petroleum ether (50 ml) . ™is P r -,32-35°C. 

• . 45 a (63%) of the title compound. M.p. 132 35 c. 
gi ve 3 45 g ^ fQund (calc>): C: 56.95(57.46); H: 

Analysis for c 17 h 17 w 5 u 4 i.^^ . Q nn / G 

^IM.M), ■ ■ 19.35(19.71). 'H-NMR (250 MHZ,- CDC1 > = .77 ,s. 
1H H-2 or H-8); 7.99 (s. 1H. H-2 or H-9) , 7.45-7.26 m. SB. 
1H, H 2 or n I. ph-CH,); 4-27 (9. 2H. 

p h!; 5.31 (». 2H ■-<»> • *•» ^ M , 15 „, CHjCHj) . "C-KMR: 
a.,.15 Hz. C&CH, and 1.3« t 3H ^ ^ o9 FAB-MS: 

153.09, 143.11; 129 ' 66 ' "• 84 ; R 6 . 2 / re ' guencyinOT -' (intensity,. 

" - 1 - - ' - 3031 - ■ 91 - 2981 :r» ; 
I.™..., - 52(25 - 2,; i5ii,45 - 2,: 

1492(37.9); 1465(14.0) and 1413(37.3). 

EXAMPLE 20 . 
N-S-Benzyloxycarbonyl^-carboxymethyl adenine 20) 

N^-Benzyloxycarbonyl-9-carboxymethylademne ethy 
r.r (19 3 20 g; 9.01 mmol) was mixed with methanol (50 ml) 
6 Tied to Cc Sodium Hydroxide Solution (50 ml; 2H) was 
Td d whereby the material guicKly dissolved. After 30 mm 
af oic the alKaline solution was washed with methylene 
c -oride (2x50ml) . The agueous solution was brought to pH 1.0 
CL ' at o-c. whereby the title compound precipitated, 
with N HCl at o t-, w drying was 



25 



With 4 N ntl at u ^> J , no was 

I 4 .id after filtration, washing with water, and drying was 

Tne yield after t .,, r ,^ qa i t and elemental 

•,.38 a (104%). The product untamed salt and 



3.08 9 (X0«1- x«« »~~- (calc): C: 

analysis reflected tha. Anal ^ ^ ^ 4Q) and C/N: 

/ic t-we;^ 05)- H: 4.24 (4.00), »• 

46- 32(55. us; , d H _ 

■> -7(7 56) >H-NMR(250 MHz; DMS0-d 6 ) s 8.70 (S, 

"« (m 5H Ph). 5.27 (s, 2H, N-C&) ; and 5.15 (s. 
J0 8;; 7.50-7.35 (m. 5H, Ph) , 

Ph-CH) "C-NMR. 168.77, 152.54, 151.36, 148. /b. 

2r., Pn-CfibJ • ^ (KBr) 3484 18.3) 

m 1-5-7 Qft 66.76 and 44.67.IR 

9 3087(15 o',; 2966,17.1,; 2927,19.9,; 2383(53.8, 

, 1739(2 5 1688,5.2, , 1655,0.9,; 1594 ,11.7, 

- „«.,«. , 173»«. ). „,„»..., "55(14.0, 

35 1560 ,12.3 ; 153 0 .2* 3). 1 ^ ^ (MH+ 
1429 (24.5) and 1411(23. • 
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C0 2 ) . HPLC (215 nm, 260 nm) in system 1: 15.18 min, minor 
impurities all less than 2%. 



EXAMPLE 21 

N-6-Benzyloxycarbonyl-l- (Boc-aeg) adenine ethyl ester (21) 

5 N' -Boc-aminoethyl glycine ethyl ester (8, 2.00 g; 

8.12 mmol), DhbtOH (1.46 g; 8.93 mmol) and N-6-benzyloxy- 
carbonyl-9-carboxymethyl adenine (20, 2.92 g; 8.93 mmol) were 
dissolved in DMF (15 ml) . Methylene chloride (15 ml) then was 
added. The solution was cooled to 0°C in an ethanol/ice bath. 

10 DCC (2.01 g; 9.74 mmol) was added. The ice bath was removed 
after 2.5 h and stirring was continued for another 1.5 hour at 
ambient temperature. The precipitated DCU was removed by 
filtration and washed once with DMF (15 ml), and twice with 
methylene chloride (2 x 15 ml) . To the combined filtrate was 

15 added more methylene chloride (100 ml) The solution was 
washed successively with dilute sodium hydrogen carbonate (2 

* 

x 100 ml) , dilute potassium hydrogen sulfate (2 x 100 ml) , and 
saturated sodium chloride (1 x 100 ml) . The organic phase was 
evaporated to dryness, in vacuo, which afforded 3.28 g (73%) 

20 of a yellowish oily substance. HPLC of the raw product showed 
a purity of only 66% with several impurities, both more and 
less polar than the main peak. The oil was dissolved in 
absolute ethanol (50 ml) and activated carbon was added. After 
stirring for 5 minutes, the solution was filtered. The 

25 filtrate was mixed with water (30 ml) and was left with 
stirring overnight. The next day, the white precipitate was 
removed by filtration, washed with water, and dried, affording 
1.16 g (26%) of a material with a purity higher than 98% by 
HPLC. Addition of water to the mother liquor afforded another 

30 0.53 g with a purity of approx. 95%. Anal, for C 26 H n N,0 7 oH 2 0 
found (calc.) C: 55.01(54.44; H: 6.85(6.15) and N: 16 .47 (17 . 09) . 
^-NMR (250 MHz, CDCl 3 ) 8.74 (s, 1H, Ade H-2) ; 8.18 (b. s f 1H, 
ZNH) ; 8.10 Ec 8.04 (s, 1H, H-8) ; 7.46-7.34 (m, 5H, Ph) ; 5.63 
(unres. t, 1H, BocNH) ; 5.30 (s, 2H, PhCH 2 ) ; 5.16 &5.00 <s, 2H, 

35 CH : CON) ; 4.29 & 4.06 (s, 2H, CE 2 C0 2 H) ; 4.20 (q f 2H, 0CH a CH 3 ) ; 
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3.67-3.29 (m, 4H, C&CIL) ; 1.42 (s, 9H, c Bu) and 1.27 (t, 3H, 
OCH 2 CHj) . The spectrum shows traces of ethanol and DCU. 



EXAMPLE 22 

N-6-Benzyloxycarbonyl-l- (Boc-aeg) adenine (22) 

5 N-6-Benzyloxycarbonyl-l- (Boc-aeg) adenine ethyl ester 

(21, 1.48 g; 2.66 mmol) was suspended in THF (13 ml) and the 
mixture was cooled to 0°C. Lithium hydroxide (8 ml; 1 N) was 
added. After 15 min of stirring, the reaction mixture was 
filtered, extra water (25 ml) was added, and the solution was 

10 washed with methylene chloride (2 x 25 ml) . The pH of the 
aqueous solution was adjusted to pH 2.0 with 1 N HCl . The 
precipitate was isolated by filtration, washed with water, and 
dried, affording 0-82 g (58%). The product reprecipitated 
twice with methylene chloride/ petroleum ether, 0.77 g (55%) 

15 after drying. M.p. 119°C (decomp.) Anal, for C 24 H a9 N,0 7 °H 2 0 

found(calc) C: 53.32(52.84); H: 5.71(5.73); N: 17.68(17.97). 
FAB-MS . 528.5 (MH+). l H-NMR (250 MHz, DMSO-dJ . 12.75 (very 
b, 1H, C0 2 H) ; 10.65 (b. s, 1H, ZNH) ; 8.59 (d, 1H, J= 2.14 Hz, 
Ade H-2); 8.31 (s, 1H, Ade H-8) ; 7.49-7.31 (m ( 5H, Ph) ; 7.03 
2 0 & 6.75 (unresol. t, 1H, BocNH) ; 5.33 & 5.16 (s, 2H # CH 2 CON) ; 
5.22 (s, 2H, PhCiL) ; 4.34-3.99 (s, 2H, CH 2 CO,H) ; 3.54-3.03 (m's, 
includes water, C&CH,) and 1.39 & 1.37 (s, 9H, c Bu) . 13 C-NMR. 
170.4; 166.6; 152.3; 151.5; 149.5; 145.2; 128.5; 128.0; 127.9; 
66.32; 47.63; 47.03; 43.87 and 28.24. 



25 EXAMPLE 23 

2-Amino-6-chloro-9-carboxymethylpurine (23) 

To a suspension of 2-amino-6-chloropurine (5.02 g; 

29.6 mmol) and potassium carbonate (12.91 g; 93.5 mmol) in DMF 

(50 ml) was added bromoacetic acid (4.70 g; 22.8 mmol). The 

3 0 mixture was stirred vigorously for 20 h. under nitrogen. Water 

(150 ml) was added and the solution was filtered through Celite 

to give a clear yellow solution. The solution was acidified 

to a pH of 3 with 4 N hydrochloric acid. The precipitate was 

filtered and dried, in vacuo, over sicapent. Yield (3.02 g; 
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44.8%). X H-NMR (DMSO-d6) : d = 4.88 ppm (s,2H); 6.95 (s,2H); 
8.10 (s,lH). 

EXAMPLE 24 

2 -Amino- 6 -benzyloxy- 9 -carboxyme thy 1 purine ( 24 ) 

5 Sodium (2.0 g; 87.0 mmol) was dissolved in benzyl 

alcohol (20 ml) and heated to 130°C for 2 h. After cooling to 
0°C, a solution of 2-amino-6-chloro-9-carboxymethylpurine (23, 
4.05 g; 18.0 mmol) in DMF (85 ml) was slowly added, and the 
resulting suspension stirred overnight at 20°C. Sodium 

10 hydroxide solution (IN, 100 ml) was added and the clear 
solution was washed with ethyl acetate (3 x 100 ml) . The water 
phase then was acidified to a pH of 3 with 4 N hydrochloric 
acid. The precipitate was taken up in ethyl acetate (200 ml) , 
and the water phase was extracted with ethyl acetate (2 x 100 

15 ml; . The combined organic phases were washed with saturated 
sodium chloride solution (2 x 75 ml) , dried with anhydrous 
sodium sulfate, and taken to dryness by evaporation, in vacuo. 
The residue was recrystallized from ethanol (300 ml) . Yield 
afcer drying, in vacuo, over sicapent: 2.76 g (52%). M.p. 159- 

20 65°C. Anal. (calc, found) C<56.18; 55.97), H{4.38; 4.32), 
N(23.4; 23.10). X H-NMR (DMSO-d 6 ) : 4.82 ppm.(s,2H); 5.51 
(s,2H); 6.45 (s,2H); 7.45 (m,5H); 7.82 (s,lH). 

EXAMPLE 2 5 

N- ( [2-Amino-6-benzyloxy-purine-9-yl] -acetyl) -N- (2-Boc- 
25 aminoethyl) -glycine [BocGaeg monomer] (25) 

2 -Amino- 6 -benzyloxy- 9- carboxyme thyl -purine (24, 0.50 

g; 1.67 mmol), N' -Boc-aminoethyl glycine methyl ester (0.65 g; 
2. SO mmol), diisopropylethyl amine (0.54 g; 4.19 mmol), and 
bromo- tri s - pyrrolidino-phosphonium-hexaf luoro- phosphate 

30 (PyBroP®) (0.798 g; 1.71 mmol) were stirred in DMF (2 ml) for 
4 r. . The clear solution was poured into an ice-cooled solution 
of sodium hydrogen carbonate (1 N; 40 ml) and extracted with 
ethyl acetate (3 X 4 0 ml) . The organic layer was washed with 
pczassium hydrogen sulfate solution (IN; 2 X 40 ml) , sodium 

3 5 hydrogen carbonate (1 N; 1 X 4 0 ml) and saturated sodium 



* 
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15 



20 



ch'oride solution (60 ml) . After drying with anhydrous sodium 
sulfate and evaporation, in vacuo, the solid residue was 
re-rystallized from ethyl acetate /hexane (20 ml (2:1)) to give 
the methyl ester in 63% yield (MS-FAB 514 (M + l) . Hydrolysis 
was accomplished by dissolving the ester in ethanol/water (30 
ml (1:2)) containing cone, sodium hydroxide (1 ml). After 
stirring for 2 h, the solution was filtered and acidified to 
a P H of 3, by the addition of 4 N hydrochloric acid. The title 
compound was obtained by filtration. Yield: 370 mg (72% for 
the hydrolysis) . Purity by HPLC was more than 99%. Due to the 
li-ited rotation around the secondary amide several of the 
signals were doubled in the ratio 2:1 (indicated in the list 
by mj. for major and mi. for minor). 1 H-NMR(250, MHz, DMSO- 
d. : d - 1.4 ppm. (s,9H); 3.2 (m,2H); 3.6 (m,2H); 4.1 (a, mj . , 
OONRqfaCOOH) ; 4.4 (s, mi., CONRC&COOH) ; 5.0 (s, mi., Gua-C&CO- 
)• 5 2 (s. mj., Gua-C^CO); 5.6 (s,2H); 6.5 (s,2H); 6.9 (m, 
mi., BocNH) ; 7.1 (m, m j . , BocNH) ; 7.5 (m.,3H); 7.8 (s,lH); 12,8 
(s-lH). "C-NMR. 170.95; 170.52; 167.29; 166.85; 160.03; 
153 78; 155.84; 154.87; 140.63; 136.76; 128.49; 128.10; 113.04; 
78.19; 77.86; 66.95; 49.22; 47.70; 46.94; 45.96; 43.62; 43.31 
and 28.25. 



25 



30 



35 



EXAMPLE 26 

Methyl «-f ormylsuccinate, Figure 1 (26a) 

in a modification of the procedure of Fissekis and 
Sw-et Biochemistry 1970, 9, 3136-42, sodium methoxide (40.5 
g, 0.75 mol) was suspended in dry ether (500 ml) and stirred 
urd-r nitrogen at 0 »C. A mixture of dimethylsuccinate (65.4 
nf 0.50 mol) and methylf ormate (123 ml, 2.00 mol) was added 
d-oDwise over 30 min. The reaction mixture was stirred at 0 
"for 2 hours and then at room temperature overnight. 
Subsequently, the reaction mixture was evaporated to a viscous 
brown residue which was washed once with petroleum ether and 
th-n dissolved in 3 M hydrochloric acid (160 ml). This 
sc-ution was made weakly acidic with concentrated hydrochloric 
ac-d and then extracted with dichloromethane (4x250 ml) . The 
cyanic phase was dried (MgSOJ , filtered and evaporated under 
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reduced pressure. The resulting residue was distilled in a 
kugelrohr apparatus at 60 °C and 0.6 mBar yielding 52-3 g of 
a mixture of the title compound and dimethyl succinate in the 
molar ratio 80:20 (determined by NMR) as a colorless oil. This 
5 mixture can be used directly in the following preparation. The 
product can be isolated free of dimethyl succinate by 
exchanging the extraction with dichloromethane with a 
continuous extraction with diethyl ether. However, in our 
hands this reduced the yield to 34 %. Fissekis and Sweet, ibid, 
10 had reported a 62% yield. l H-NMR (DMSO-d 4 /TMS) : 6 = 3.20 (s, 
2H, CH 3 ) ; 3.59 (s, 3H, OMe) ; 3.61 (s, 3H, OMe) ; 7.73 (s, 1H, 
CHOH); 10.86 (br s, 1H, CHOH). U C-NMR (DMSO-d,/TMS) : 6 = 28.9 
(CH : ) ; 51.0 (OMe); 51.6 (OMe); 102.1 (C=CHOH) ; 156.6 (CHOH) ; 
168.3 (COO); 171.7 (COO). 

15 EXAMPLE 27 

Isocytosin-5-ylacetic acid (27) 

In a modification of the procedure of Beran et al . , 
Collect. Czech. Chem. Commun. 1983, 48, 292-8, sodium met hoxide 
(41.9 g, 0.78 mol) was dissolved in dry methanol (200 ml) and 

20 guanidine hydrochloride (49.4 g, 0.52 mol) was added. The 
mixture was stirred for 10 min under nitrogen at room 
temperature. A solution of methyl oe-£ ormylsuccinate (26, 30.0 
g, 0.17 mol) in dry methanol (100 ml) was added to the mixture. 
The reaction mixture was refluxed under nitrogen for 3 hours 

25 and then stirred at room temperature overnight. Subsequently, 
the reaction mixture was filtered, and the filter cake was 
washed once with methanol. The collected filtrate and washing 
were evaporated under reduced pressure. The resulting residue 
was dissolved in water (80 ml) and the solution was acidified 

30 with concentrated hydrochloric acid to pH 4.2. After having 
been stirred at 0 °C the mixture was filtered, the precipitate 
washed once with water and then freeze-dried leaving 28.29 g 
(97 %) of the title compound as a white solid. Calcd. for 
C ( H.NA 1/2H 2 0: C, 40.45; H, 4.53; N, 23.59. Found: C, 40.13; H, 

35 4.22; N, 23.26. 
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Due to the poor solubility properties of the product 
it was further characterized as its sodium salt. 27 (0.42 g, 
2.5 mmol) and sodium bicarbonate were dissolved in boiling 
water (35 ml) . The solution was cooled and evaporated. The 
5 residue was dissolved in water (6 ml) and ethanol (4 ml) and 
isopropanol (8 ml) were added. The sodium salt of 27 was 
collected by filtration, washed with abs. ethanol and petroleum 
ether and dried to yield 0.31 g (65 %) as white crystals. *H- 

NMR (D,0/TMS) : 6 = 3.10 (s, 2H, CH 3 COO) ; 7.40 (s, 1H, H-6) . "O 
10 NMR (DMS0-d,/TMS) : 6 = 34.8 (CH,C00) ; 112.0 (C-5) ; 145.6-146.5 

(m, C-2); 155.1 (C-6); 169.4 (C-4); 179.3 (COOH) . 
MS (FAB+) m/z {%) : 192 (100, M+H) . 

EXAMPLE 28 

Methyl isocytoain-5-ylacetate (28) 

15 Thionyl chloride (3.6 ml, 50 mmol) was added to 

stirred methanol (210 ml) at -40 °C under nitrogen. 
Isocytosin-5-ylacetic acid (27, 7.0 g, 41 mmol) was added and 
the reaction mixture was stirred at room temperature for 1 
hour, at 60 °C for 3 hours and overnight at room temperature. 

20 The reaction mixture was evaporated to dryness and the residue 
was dissolved in saturated aqueous sodium bicarbonate (80 ml) 
giving a foamy precipitate. 4 M hydrochloric acid was added 
to pH 6.5 and the suspension was stirred for 1 hour. The 
precipitate was collected by filtration, washed with water, re- 

25 crystallized from water and freeze-dried yielding 4.66 g (62 
%) of methyl isocytosin-5-ylacetate as white crystals. 

l H-NMR (DMSO-d,/TMS) : 6 = 3.28 (s, 2H, CH a COO) ; 3.64 (s, 3H, 
COOMe); 6.87 (br s, 2H, NH a ) ; 7.54 (s, 1H, H-6). "C-NMR (DMSO- 
djms) : 6 = 32.0 (CILCOO) ; 51,5 (COOMe); 108.4 (C-5); 153.3 (C- 
30 2); 156.4 (C-6); 164.0 (C-4) ; 171.8 (CH 2 C00) . MS (FAB+) m/z 
(%) : 184 (100, M+H) . Calcd. for CH,NjO, 3/2H 2 0: C, 40.00; H, 
5.75; N, 19.99. Found: C, 40.18; H, 5.46; N, 20.30. 
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EXAMPLE 29 

Methyl N-2- (benzyloxycarbonyl) isocytosin-5-ylacetate (29) 

Methyl isocytosin-5-ylacetate (28, 9.5 g, 52 mmol) 
was dissolved in dry DMF (95 ml) and the solution was stirred 
5 at 0 °C under nitrogen. N-Benzyloxycarbonyl-N' -methyl - 
imidazolium triflate (37.99 g, 104 mmol) was added slowly. The 
reaction mixture was stirred for 30 min at 0 °C and then 
overnight at room temperature, Dichloromethane (800 ml) was 
added and the resultant mixture was washed with half -saturated 

10 aqueous sodium bicarbonate (2x400 ml), half -saturated aqueous 
potassium hydrogen sulfate (2x400 ml) and with brine (1x400 
ml) . The organic phase was dried (MgS0 4 ) , . filtered and 
evaporated under reduced pressure. The residue was 
recrystallized from methanol affording 13.32 g (81 %) of the 

15 title compound as white crystals. *H-NMR (DMSO-d,/TMS) : 6 = 3.43 

(s, 2H, CH 2 COO) ; 3.67 (s, 3H, COOMe) ; 5.30 (s r 2H, PhOL) ; 7.43- 
7.52 (m, 5H, £hCHj; 7.77 (s, 1H, H-6) . "C-NMR (DMS0-d«/TMS) : 
6 = 31.9 (£HjCOO); 51.6 (COOMe); 67.0 (Ph£Hj ; 128.1-128.5 (m, 

PhCHj; 135.7 (PhCH 2 ) ; 150.7 (Z-CO) ; 170.8 (COO). 
20 MS (FAB+) m/z (%) : 318 (3.5, M+H) Calcd. for C 15 H l5 N 3 O s : C, 
56.78; H f 4.76; N, 13.24. Found: C, 56.68; H, 4.79; N, 13.28. 

EXAMPLE 30 

N-2 (Benzyloxycarbonyl) isocytosin-5-ylacetic acid (30) 

Methyl N-2 (benzyloxycarbonyl) isocytosin-5-ylacetate 
25 (29, 5.2 g, 16 mmol) was suspended in THF (52 ml) and cooled 
to 0 °C. 1 M lithium hydroxide (49 ml, 4 9 mmol) was added and 
the reaction mixture was stirred at 0 °C for 25 min. 
Additional 1 M lithium hydroxide (20 ml, 20 mmol) was added and 
the mixture was stirred at 0 °C for 90 min. The product was 
30 precipitated by acidifying to pH 2 with 1 M hydrochloric acid, 
collected by filtration, washed once with water and dried to 
yield 4.12 g (83 %) of white crystals. l H-NMR (DMS0-d tf /TMS) : 6 

= 3.33 (s, 2H, CH 3 C00) ; 5.29 (s, 2H, PhCHJ ; 7.43-7.52 (m, 5H, 
PhCH,) ; 7.74 (s, 1H, H-6); 11.82 (br s, 3H, exchangeable 
35 protons). MS (FAB+) m/z (%) : 304 (12, M+H) Calcd. for 
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C I4 H :1 HA: C, 55.45; H, 4,32; N, 13.86. Pound: C, 55.55; H, 4.46; 
N, 13.84. 



EXAMPLE 31 

Ethyl N- (2-BOC-aminoethyl) -N- (N-2 (benzyloxycarbonyl) isocytosin- 
5 5-ylacetyl)glycinate (31) 

N-2 (Benzyloxycarbonyl) isocytosin- 5 -ylacetic acid (30, 
2.0 g, 6.6 mmol) was transferred to a flask equipped with a 
stirring bar and a septum through which a flow of nitrogen was 
applied. Dry DMF (20 ml) and N-methylmorpholine (2.2 ml, 19.8 

10 mmol) were added. The mixture was cooled to 0 °C and ethyl N- 
( 2-BOC-aminoethyl) glycinate (1.8 g, 7.3 mmol) and 0- 
benzotriazol -1-yl -N,N, N ' , N ' -tetramethyluronium 
hexafluorophosphate (HBTU, 3.0 g, 7.9 mmol) were added. The 
reaction mixture was stirred under nitrogen for 4 h followed 

15 by addition of dichloromethane (100 ml) . The organic phase was 
washed with half -saturated aqueous sodium bicarbonate (2x75 
ml) , half-saturated aqueous potassium hydrogen sulfate (2x75 
ml) and with brine (1x75 ml), dried (MgSOj , filtered and 
evaporated under reduced pressure. The residue was dissolved 

20 in ethyl acetate (50 ml), stirred at 0 °C for 10 min and 
filtered through celite which was washed with ethyl acetate. 
The collected filtrate and washing were concentrated to a 
volume of 10 ml. Diethyl ether (100 ml) was added and the 
resultant solution was stirred overnight at room temperature. 

25 The product was collected by filtration, washed once with 
diethyl ether and dried to yield 2.6 g (74 %) of the title 
' compound as white crystals. Due to hindered rotation around 
the amide bond several of the NMR signals are comprised of a 
major (ma) and minor (mi) component. 'H-NMR (DMSO-d tf /TMS) : 6 

30 = 1.20-1.30 (m, 3H, CH^) ; 1.45 (s, 9H, BOC) ; 3.05-3.52 (m, 
6E, NCH, # CH,N, CH a C0N) ; 4.08 and 4.4 0 (s, ma and s, mi, 
respectively, 2H, CH 2 COO) ; 4.15 and 4,25 (q, ma, J=7 H2 and q, 
mi, respectively, 2H, OLCHJ ; 5.29 (s, 2H, PhCHJ ; 7.40-7.52 
(r., 5H # PhCHJ ; 7.64 and 7.67 (s, mi and s, ma, respectively, 

35 IE, H-6) . "C-NMR (DMS0-d«/TMS) : 6 = 14.1 (CHjCHj ; 28.2 (BOC) ; 
3C.2 and 30.5 (ma and mi, respectively, CH.CON) ; 37.9 and 38.3 
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(mi and ma, respectively, NCH a ) ; 47.7 and 48.0 (ma and mi, 
respectively, CH a N) ; 50.2 (CRCOO) ; 60.4 and 61.0 (ma and mi, 
respectively, CH,CH,) ; 67.0 (PhCHJ ; 127.9-128.5 (m, £hCH 3 ) ; 
135.8 (£kCH 3 ) ; 155.7 (C-6), 169.4 (CON); 170.0 (COO). MS 
5 (FAB+) m/2 (%): 532 (3.5, M+H) ; 432 (3.5, M-BOC+H) Calcd. for 
C„H„NA: C, 56.49; H, 6.26; N, 13.17. Found: C, 56.46; H, 6.14; 
N, 12.86. 

EXAMPLE 32 

N- (2-BOC-aminoethyl) -N- (N-2- (benzyloxycarbonyl) isocytos±n-5~ 
10 yl acetyl) glycine (32) 

Ethyl N- (2-BOC-aminoethyl) -N- (N-2- (benzyloxycarbon- 
yl) isocytosin- 5 -ylacetyl) glycinate (31, 1.6 g, 3.0 mmol) was 
dissolved in methanol (16 ml) by gentle heating. The solution 
was cooled to 0 °C and 2 M sodium hydroxide (23 ml) was added. 

15 The reaction mixture was stirred at room temperature for 75 min 
and then cooled to 0 °C again. The pH was adjusted to 1.7 and 
the product was collected by filtration, washed once with water 
and dried to give 1.24 g (82 %} of 32 as white crystals. 

Due to hindered rotation around the amide bond 

20 several of the NMR signals are comprised of a major (ma) and 
minor (mi) component. *H-NMR (DMSO-d,/TMS) : 6 = 1.45 (s, 9H, 

BOC); 3.05-3.52 (m, 6H, NCH 2 , CH 2 N, CH,CON) ; 4.01 and 4.29 (s, 
ma and s, mi, respectively, CH 3 COO) ; 5.29 (s, 2H, PhCHJ ; 7.40- 
7.51 (m, 5H, PhCH 3 ) ; 7.63 and 7.68 (s, mi and s, ma, 
25 respectively, 1H, H-6) . "C-NMR (DMSO-d,/TMS) : <5 = 28.2 (BOC); 

3 0.2 and 30.5 (ma and mi, respectively, CHACON); 37.9 and 38.3 
(mi and ma, respectively, NCHJ ; 47.5 and 47.9 (ma and mi, 
respectively, CH,N) ; 50.2 (C&COO) ; 67.0 (PhCH,) ; 128.0-128.5 
(m, PhCHJ ; 135.8 (PhCHj ; 150.5 (Z-CO) ; 155.7 (C-6); 169.9 and 
30 170.3 (ma and mi, respectively, CON); 170.8 and 171.2 (ma and 
mi, respectively, COO) MS (FAB+) m/z (%) : 504 (16, M+H); 448 
(3.5, M-t-Bu+H); 404 (23, M-BOC+H) 
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EXAMPLE 33 

Ethyl (2 -thiouracil-5-yl) acetate, Figure 2 (33) 

All operations were carried out in dry equipment 
under an atmosphere of nitrogen. Sodium (4.36 g, 190 mmol) was 
5 dissolved in abs. ethanol (440 ml). Thiourea (14.4 g, 190 
mmol) and methyl or-formylsuccinate (26, 30.0 g, 172 mmol) were 
added. The reaction mixture was refluxed for 6 hours and, 
subsequently, evaporated to dryness under reduced pressure. 
Cold 15 % aqueous acetic acid (300 ml) was added to the 

10 residue. The mixture was stirred at 0 °C overnight and 
filtered. The precipitate was washed once with water and dried 
to yield 12.29 g (37%) of the title compound as a white solid. 
l H-NMR (DMSO-d s /TMS) : d= 1.25 (t f 3H r J=7H2, CHJ ; 3.35 (s, 2H, 
CH ; COO) ; 4.13 (q, 2H, J=7 Hz, OCH 3 ) ; 7.52 (s, 1H, H-6) ; 12.35 

15 (br, 2H, exchangeable protons) . "C-NMR (DMSO-d,/TMS) : 6 = 14.1 

(CH,); 31.6 (CHXOO) ; 60.3 (OCH 3 ) ; 111.7 (C-5) ; 140 9 (C-6) , 
161.2 (C-4); 170.1 (C-2) ; 175.5 (COO). 

MS (FAB+) m/z (%) : 215 (57, M+H) . Calcd. for C,H lfl NAS: C, 
44.85; H, 4.70; N, 13.08; C/N, 3.43. Found: C, 42.95; H, 4.58; 
20 N, 12.89; C/N, 3.33. 

EXAMPLE 34 

Uracil- 5 -ylacetic acid (34) 

Ethyl (2-thiouracil-5-yl) acetate (33, 7.8 g, 36 mmol) 
was mixed with chloroacetic acid (1.9 g, 20 mmol) and water (47 

25 ml) and refluxed for 2 hours. Concentrated hydrochloric acid 
(22 ml) was added and the reaction mixture was refluxed 
overnight. The reaction mixture was filtered and the 
precipitate was washed once with water and dried. The 
procedure was repeated with the precipitate in place of 8 

30 yielding 4.19 g (68%) of the title compound as a white solid. 

: H-NMR (DMSO-d 6 /TMS) : 2= 3.13 (s, 2H, CH 2 COO) ; 7.35 (d, 1H, 
J=6.5 Hz, H-6); 10.74 (m, 1H, H-l) ; 11.09 (s, 1H, H-3) ; 12.20 
(br, 1H, COOH) . l3 C-NMR (DMSO-d,/TMS) : 6 = 31.3 (CILCOO) ; 106.6 
(C-5); 139.7 (C-6); 151.2 (C-2); 164.0 (C-4); 171.9 (COOH). 
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EXAMPLE 35 

Ethyl N- (2-BOC-amin thyl) -N- ( uracil- 5 -ylacetyl) glycinate (35) 

Uracil-5-ylacetic acid (34 , 1.0 g, 5.9 mmol) was 
transferred to a round bottomed flask equipped with a septum 
5 through which a flow of nitrogen was applied. Dry DMF (10 ml) 
and N-methylmorpholine (1.9 ml, 17.6 mmol) were transferred to 
the flask and the mixture was cooled to 0 °C. Ethyl N-(2-BOC- 
aminoethyl) glycinate (1.6 g, 6.5 mmol) and 0- (3 , 4-dihydro-4- 
oxo-1 , 2 , 3-benzotriazin-3-yl) -N,N,N" ,N"-tetramethyl uronium te- 

10 traf luoroborate (TDBTU, 2.5 g, 7.0 mmol) were added to the 
mixture. The reaction mixture was stirred for 3 hours at 0 °C 
and was then poured into ethyl acetate (300 ml) * The resultant 
suspension was washed with saturated aqueous potassium hydrogen 
sulfate (2x50 ml) , saturated aqueous sodium bicarbonate (2x50 

15 ml) and with brine (1x50 ml) . The organic phase was dried over 
magnesium sulfate, filtered and evaporated under reduced 
pressure. The residue was stirred overnight in dichloromethane 
(10 ml) and diethyl ether (40 ml) . The resultant precipitate 
was isolated by filtration, washed once with diethyl ether and 

20 dried to give 1.13 g (48%) of the title compound as white 
crystals . 

Due to hindered rotation around the amide bond 
several of the NMR signals are comprised of a major (ma) and 
minor (mi) component. l H-NMR (DMSO-d,/TMS) : 6 = 1.27 (m, 3H, 

25 CH,) ; 1.45 (s, 9H, BOC) ; 3.10-3.49 (m, CH,C0N, NCH a , CH,N and 
water); 4.07 (ma), 4.37 (mi) (s, 2H, CH 2 COO) ; 4.20 (m f 2H, 
OCH 2 ) ; 6.75 (mi), 6.95 (ma) (br, 1H, BOC-NH) ; 7.31 (s, 1H, H- 
6); 10.80 (br, 1H, H-l) ; 11.15 (br, 1H, H-3) . MS (FAB+) m/z 
(%; : 399 (29, M+H) ; 299 (100, M-BOC+H) . Calcd. f or C^H^N.O, : C, 

30 51.25/ H, 6.58; N, 14.06; C/N, 3.65. Found: C, 50.62; H, 6.51; 
N, 13.60; C/N, 3.72. 

EXAMPLE 36 

N- (2 -BOC -Amino thyl) -N- (uracil-5-ylacetyl) glycine (36) 

Ethyl N - (2-BOC-aminoethyl) - N - (uracil-5- 
35 ylacetyl) glycinate (35, 1.00 g, 2.5 mmol) was dissolved in 1 
M aqueous sodium hydroxide (50 ml) and the mixture was stirred 
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for 15 min at room temperature. The reaction mixture was 
cooled to 0 °C and the pH was adjusted to 1.5 by the addition 
of 2 M hydrochloric acid. After 30 min the aqueous solution 
was extracted with n-butanol (4x80 ml) . The n-butanol of the 
5 combined organic phases was evaporated under reduced pressure. 
Residual n-butanol was removed azeotropically with water (5x50 
ml) . The resultant aqueous solution was freeze -dried to yield 
0.S3 g (100%) of the title compound as a white solid. Due to 
hindered rotation around the amide bond several of the NMR 
10 signals are comprised of a major (ma) and minor (mi) component. 

l H-NMR (DMSO-4/TMS) : 6= 1.44 (s, 9H, BOC) ; 3.07-3.48 (m, 

CH : C0N, NCH 2 , CH a N and water); 4.00 (ma), 4.26 (mi) (s, 2H, 
CH : C00) ; 6.75 (mi), 6.94 (ma) (br, 1H, BOC-NS) ; 7.28 (mi) , 7.32 
(ma (d, 1H, J=5.5 Hz, H-6) ; 10.87 (br, 1H, H-l) ; 11.12 (br, 
15 1H, H-3). MS(FAB+) m/z (%) : 371 (26, M+H) . 

w 

Caicd. for C l5 H aj N 4 0 7 : C, 48.65; H, 5.99; N, 15.13; C/N, 3.22. 
Found: C, 35.13; H, 4.66; N, 10.48; C/N, 3.35, 

EXAMPLE 37 

N-2- (Benzyloxycarbonyl) isocytosine, Figure 3 (37) 

20 Isocytosin (5.0 g, 45 mmol) was dissolved in dry DMF 

(5C ml) by heating. The solution was cooled to 0 °C and N- 
benzyloxycarbonyl -N'- methyl imidazolium triflate (33 g, 90 mmol) 
was added slowly. The reaction mixture was stirred under 
nirrogen at 0 °C for 3 0 min and then overnight at room 

25 temperature. Dichloromethane (400 ml) was added and the 
organic phase was washed with half -saturated aqueous sodium 
bicarbonate (2x200 ml) , half -saturated aqueous potassium 
hydrogen sulfate (2x200 ml) and with brine (1x200 ml), dried 
(MgSOJ , filtered and evaporated under reduced pressure. The 

30 residue was recrystallized from methanol yielding 7.52 g (68 
%) of the title compound as white crystals. X H-NMR (DMSO- 
d,/TMS) : d= 5.28 (s, 2H, PhCHJ ; 6.00 (d, 1H, J=7.0 Hz, H-5) ; 
7.43-7.51 (m, 5H, PhCH a ) ; 7.77 {d, 1H, J=7.0 Hz, H-6); 11.57 
(s, 2H, exchangeable protons). u ONMR (DMSO-d,/TMS) : tf= 67.0 

35 (PhOL) ; 107.8 (C-5) ; 128.0-128.5 (m, 2&CHJ ; 135.8 (fhCHj ; 
151.9 (Z-CO) . MS ( FAB+ ) m/z (%) : 246 (15, M+H). 
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EXAMPLE 38 

Ethyl N- <2-BOC-aminoethyl) -N- (bromoacetyl) glycinate (38) 

N' -Boc-aminoethylglycine ethyl ester (8, ,5.95 g, 
24.2 mmol) was dissolved in dichloromethane (15 ml) and cooled 
5 to 0 °C. 3,4-Dihydro-3-hydroxy-4-oxo-l, 2 , 3-benzotriazine 
(DhbtOH, 4.34 g, 26.6 mmol), dicyclohexylcarbodiimide (DCC, 
5.98 g, 29,0 mmol) in dichloromethane (15 ml) and bromoacetic 
acid (3.69 g, 26.6 mmol) in dichloromethane (30 ml) were added. 
The reaction mixture was stirred at 0 °C for 100 min and then 

10 at room temperature for 100 min. It was then filtered and the 
precipitate was washed with dichloromethane (3x30 ml) ♦ The 
collected filtrate and washings were washed with saturated 
aqueous sodium bicarbonate (3x120 ml) , saturated aqueous 
potassium hydrogen sulfate (2x120 ml) and with brine (2x120 

15 ml). The organic phase was dried (Na 2 S0 4 ) , filtered and 
evaporated under reduced pressure. The residue was filtered 
through silica (30 g, EtOAc/petroleum ether 25:75, v/v until 
the fastest moving spots on TLC had been removed and then 
50:50, v/v to collect the product). The collected fractions 

20 were evaporated under reduced pressure to yield 8.29 g (93 %) 
of the title compound as a yellow oil. This oil was used in 
the next step without further purification. Due to hindered 
rotation around the amide bond several of the NMR signals are 
comprised of a major (ma) and minor (mi) component. ^-NMR 

25 (CDC1 3 /TMS) : 6 « 1.24-1.33 (m, 3H, CH,GHJ ; 1.43 and 1.44 (s, mi 
and s, ma, respectively, 9H, BOC) ; 3.28-3.32 (m, 2H, NCHJ ; 
3.54-3.56 (m, 2H, CH 2 N) ; 3.79 and 3.93 (s, mi and s, ma, 
respectively, CH 2 C0N) ; 4.02 and 4.19-4,26 (s, ma, CH,C00 and m, 
s, mi, C&CHj, CH 2 C00, respectively, 4H) . "C-NMR (CDC1 3 /TMS) : 6 

30 = 13.8 (CH 2 OL) ; 28.1 (BOC); 38.2 and 38.5 (mi and ma, 
respectively, NCH 2 ) ; 48.0 and 48.9 (mi and ma, respectively, 
CH ; N) ; 50.1 and 50.8 (ma and mi, respectively, CHXOO) ; 61.4 
(CHACON); 61.8 (CILCH,) . MS (FAB+ ) m/z (%) : 369 (10, M+2+H) ; 
367 (12, M+H) ; 313 (24, M- t-Bu+2+H) ; 311 (27, M-t-Bu+H); 269 

35 (67, M-BOC+2+H) ; 267 (75, M-BOC+H) 
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EXAMPLE 39 

Ethyl N- (2-BOC-anino thyl) - N - ( N - 2 - 
(benzyloxycarbonyl) isocytoein-l-yl- acetyl) glycinate (39) 

N-2 ( Benzyl oxycarbonyl) isocytosin (37, 2.00 g, 8.2 
5 mmol) was dissolved in dry DMF (15 ml) and potassium carbonate 
(1.69 g, 12.3 mmol) was added. The mixture was heated to 75 
°C for 30 min and then cooled to room temperature. Ethyl N- (2- 
BOC-aminoethyl)-N-(bromoacetyl) glycinate (38, 3.00 g, 8.2 mmol) 
in dry DMF (10 ml) was added and the reaction mixture was 

10 stirred overnight under nitrogen at room temperature. Water 
(150 ml) was added to the reaction mixture which was then 
extracted with dichloromethane (150 and 100 ml) . The organic 
phase was washed with saturated aqueous potassium hydrogen 
sulfate (3x100 ml) and with brine (2x100 ml) , dried (Na 2 S0 4 ) , 

15 filtered and evaporated under reduced pressure. The residue 
was chromatographed on silica (180 ml, EtOAc/n-hexane 1:1 v/v 
and then EtOAc) to yield 1.95 g (45 %) of the title compound 
as a white glassy foam. Due to hindered rotation around the 
amide bond several of the NMR signals are comprised of a major 

20 (ma) and minor (mi) component. 'H-NMR (CDC1,/TMS) : 6 = 1.22- 
1.29 (m, 3H, CH.CHJ ; 1.39 and 1.40 (s, ma and s, mi, 
respectively, 9H, BOC) ; 3.15-3.25 and 3.35-3.36 (m, mi and m, 
ma, respectively, 2H, NCHJ ; 3.50-3.53 (m, 2H, CH,N) ; 4.02 and 
4.27 (s, ma and s, mi, respectively, 2H, CH.COO) ; 4.09-4.23 (m, 

25 2H, CH.CHJ ; 4.48 and 4.71 (s, mi and s, ma, respectively, 2H, 
CH 2 C0N) ; 5.12 (s, 2H, PhCHj ; 5.80 (d, J=8 Hz, 1H, H-5) ; 7.19 
(d, J=8 Hz, 1H, H-6); 7.25-7.38 (m, 5H, PJiCH,) . 1J C-NMR 
' (CDClj/TMS) : 6 = 13.7 (CH,CHJ ; 28.0 (BOC); 38.4 (NCH,) ; 48.4 and 

48.9 (mi and ma, respectively, CH,N) ; 49.1 and 49.4 (ma and mi, 
30 respectively, CJLCON) ; 50.5 (CJLCOO) ; 61.3 and 61.8 (ma and mi, 
respectively, CJLCH,) ; 67.1 (PhCJL) ; 104.0 (C-5) ; 127.7-128.1 

(m, PhCHj; 135.8 (PhCH,) ; 144.9 (C-6); 153.9 (Z-CO) ; 159.4 (C- 
2); 161.9 (C-4); 166.4 (CON); 168.9 (COO). MS (FAB+) m/z (%) : 
532 (92, M+H) ; 476 (8, M- t-Bu+H) Calcd. f or C„H„N s O, : C, 56.49; 
H, 6.26; N, 13.17; C/N, 4.29. Found: C, 55.76; H, 6.54; N, 
12.64; C/N, 4.41. 



35 
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EXAMPLE 40 

N- (2-BOC-Aminoethyl) -N- (N-2 (b nzyloxycarbonyl) isocytosin-l-yl- 
ac e ty 1 ) glyc ine (40) 

Ethyl N- (2-BOC-aminoethyl) -N- (N 2 - ( benzyl oxy carbon - 
5 yl) isocytosin-l-ylacetyl) glycinate (39, 1.34 g, 2.5 mmol) was 
dissolved in THF (40 ml) at 0 °C. 1M aqueous lithium hydroxide 
(7.5 ml, 7.5 mmol) was added and the reaction mixture was 
stirred for 75 min at 0 °C. Water (50 ml) was added and the 
pH was adjusted to 3 with concentrated hydrochloric acid. The 

10 aqueous phase was extracted with EtOAc (2x70 ml) . The extracts 
were collected, dried (Na 2 S0 4 ) , filtered and evaporated under 
reduced pressure. The residue was dried in a desiccator over 
sicapent for 65 hours yielding 1.26 g (100 %) of product as a 
white glassy foam, 

15 Due to hindered rotation around the amide bond 

several of the NMR signals are comprised of a major (ma) and 
minor (mi) component. l H-NMR (DMSO- d f /TMS ) : 6 = 1.44 (s, 9H, 

BOO; 3.29-3.51 (m # 4H, NCH 2 and CH 2 N) ; 4.07 and 4.31 (s, ma and 
s, mi, respectively, 2H, CK,COO) ; 4.82 and 5*01 (s, mi and s, 
20 ma, respectively, 2H, CH 2 CON) ; 5.19 (s, 2H, PhCHJ : 5.99-6.01 
(m, 1H, H-5) ; 7.73 and 7.76 (d, mi, J=8 Hz and d, ma, J=8 Hz, 
respectively, 1H, H-6) ; 7.39-7.45 (m, 5H, PhCHj . l5 C-NMR 
(DMSO-d«/TMS) : 6 = 28.2 (BOC) ; 37.6 and 38.2 (mi and ma, 

respectively, NCH 2 ) ; 47.0 and 47.9 (mi and ma, respectively, 
25 CH ; N) ; 49.2 (OLCON) ; 49.4 and 49.6 (ma and mi, respectively, 
CH.COO) ; 66.8 and 66.9 (ma and mi, respectively, PhCHJ ; 103.2 
(C-5); 127.8-128.3 (m, PhCHJ: 136.5 (PhCHJ ; 147.4 (C-6) ; 154.1 
(Z-CO) ; 155.7 (BOC- CO) ; 159.5 (C-2) ; 162.2 (C-4); 166.5 and 

166.9 (ma and mi, respectively, CON); 170.3 and 170.7 (ma and 
30 mi, respectively, COO). MS (FAB+) m/z (%) : 504 (21, M+H) ; 448 
(6, M-t-Bu+H); 404 (11, M-BOC+H) ; 91 (100, PhCHJ 

EXAMPLE 41 

5-Bromouracil-N-l-m thyl acetate, Figure 4 (41) 

5-Bromouracil (5.00 g; 26.2 mmol) and potassium 
35 carbonate (7.23 g; 52.3 mmol) were suspended in DMF (75 ml). 
Methyl bromoacetate (2.48 ml; 26.1 mmol) was added over a 
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period of 5 rain. The suspension was stirred for 2 hours at 
room temperature, and then filtered. The solid residue was 
washed twice with DMF, and the combined filtrates were 
evaporated to dryness, in vacuo. The residue was an oil 
5 containing the title compound, DMF and some unidentified 
impurities. It is not necessary to purify the title compound 
before hydrolysis. *H-NMR (DMSO-d«, 250 MHz) ; 8.55 (impurity); 
8.27 (CBr«CHN); 8.02 (impurity); 4.76 (impurity); 4.70 
(impurity); 4.62 (NCH2COOCH3) ; 3.78 (COOCH 3 ) ; 2.96 (DMF); 2.80 
10 (DMF) . "C-NMR (DMSO-d fi , 250 MHz); 168.8 (COOCH 3 ) ; 172.5 
(CK=CBrCON); 161.6 (DMF); 151.9 (NCON); 145.0 (CO-CBr=CHN) ; 
95.6 (COCBr=CHN) ; 52.6 (impurity); 52.5 (OCH 3 ) ; 49.7 
(impurity); 48.8 (N£H 2 COOMe) ; 43.0 (impurity); 36.0 (DMF). 
UV(Methanol; ^nm) ; 226; 278. IR (KBr.-cm- 1 .; 3158s (_NH) ; 
15 1743VS (_O0, COOMe) ; 1701VS (_C=0, CONH) ? 1438VS (d CH, CH 3 0) ; 
122 3 vs (_ C-0, COOMe) ; 864 m id CH, Br=C-H) . FAB-MS m/z 
(assignment) : 265/263 (M+H) . 

EXAMPLE 42 

(5-Bromouracil) acetic acid (42) 

20 Water (30 ml) was added to the oil of the crude 

product from Example 41 and the mixture was dissolved by adding 
sodium hydroxide (2M, 60 ml) . After stirring at 0°C for 10 
min, hydrochloric acid (4M, 45 ml) was added to pH=2 and the 
title compound precipitated. After 50 min, the solid residue 

25 was isolated by filtration, washed once with cold water, and 
ther. dried in vacuo over sicapent. Yield: 2.46 g (38%). Mp, 
•250°-251°C. Anal, for C 6 H 5 BrN 2 0< . Found (calc): C: 28.78 
(23.94); H: 2.00 (2.02); Br: 32.18 (32.09); N: 11.29 (11.25). 
Hl-NMR (DMSO-d 6 , 250MHz): 12,55 (1H.S,C00H); 11.97 <1H,S,NH); 

30 8.30 (1H,S,C=C-H); 4.49 ( 2H , s , NCH 2 COOH) . "C-NMR (DMSO-d 6 , 250 
MHz!; 169.4 (COOH) ; 159.8 (NHCOCBr-CH) ; 150.04 (NCON) ; 145.8 
(CCCBr=CHN); 94.6 (COCBr=CHN) ; 48.8 (NCH 2 COOH) . UV (Methanol; 
^nm); 226;' 278. IR (KBr; cm" 1 ); 3187s (_NH) ; 1708vs 
7c=O,C0OH); 1687VS; 1654VS (_C=0, CONH); 1192s (_C-0, COOH) ; 

35 842 m (3 CH, Br-C=C-H) . FAB-MS m/z (assignment, 
in-ensity) ; 251/249 (M + H,5). 
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EXAMPLE 43 

N- ( Boc - amino e thy 1 ) -N- (5 -bromouracil-N-l-methylenecarbonyl) - 
glycine thyl ester (43) 

N' -Boc-aminoethylglycine ethyl ester (8, 1.80 g; 7.30 
5 mmol) was dissolved in DMF (10 ml). Dhbt-OH (1.31 g; 8.03 
mmol) was added, whereby a precipitate was formed. DMF (2 x 

10 ml) was added until the precipitate was dissolved. (5- 
Bromouracil) acetic acid (42, 2.00 g; 8.03 mmol) was added 
slowly to avoid precipitation. Methylene chloride (30 ml) was 

10 added, and the mixture was cooled to 0°C and then filtered. 
The precipitate, DCU, was washed twice with methylene chloride. 
To the combined filtrate was added methylene chloride (100 ml) . 
The mixture was washed with half saturated NaHC0 3 - solution (3 
x 100 ml, H 2 0 : saturated NaHC0 3 - solution 1:1 v/v) , then with 

15 dilute KHS0 4 - solution (2 x 100 ml, H 2 0: saturated KHS0 4 - solution 
4:1 v/v), and finally with saturated NaCl-solution (1 x 100 
ml) . The organic phase was dried over magnesium sulphate, 
filtered, and evaporated to dryness in vacuo (about 15 mmHg and 
then about 1 mmHg) . The residue was suspended in methylene 

20 chloride (35 ml), stirred for 45 min at room temperature, and 
filtered (the precipitate was DCU) . Petroleum ether (2 
volumes) was added dropwise to the filtrate at 0°C, whereby an 

011 precipitated. The liquor was decanted and the remaining 
oil dissolved in methylene chloride (20-50 ml) . Precipitated 

25 was effected by the addition of petroleum ether (2 volumes) . 
This procedure was repeated 5 times until an impurity was 
removed. The impurity can be seen at TLC with 10% MeOH/CH 2 Cl 2 
as the developing solvent. The resulting oil was dissolved in 
methylene chloride (25 ml) and evaporated to dryness in vacuo, 
30 which caused solidification of the title compound. Yield: 2.03 
g : (58%) . Mp. 87°-90°C. Anal, f or C 17 H 25 BrN 4 0 7 . Found (calc): 
C: 42.33 (42.78); H: 5.15 (5.28); Br: 17.20 (16.74); N: 1.69 
(11.74). 'H-NMR (DMSO-d 6 , 250 MHz, J in Hz): 1.93 & 11.92 
(1K,S,C=0NHC=0) ; 8.09 & 8.07 ( 1H, s , C=C-H) ; 7.00 & 6.80 
35 (IK, t , b, BocNH) ; 4.80 & 4.62 ( 2H , s , NCH 2 CON) ; 4.35 & 4.24 
( 2H , S , NCH 2 COOEt ) ; .4.27-4.15 (2H,m's / COOCH 2 CH 3 0); 3.47-3.43 
(2H # m / S, BocNHCH 2 CH 2 N) ; 3.28-3.25 & 3.12-3.09 
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(2H,m's,BocNHCH2CH- 3 N) : 1.46 & 1.45 OH^^Bu); 1,26 & 1.32 
(3H,t,J=7.1, COOCH 2 CH 3 ) - 13 C-NMR (DMS0-d 6 , 250 MHz) ; 169.3 & 
169.0 ('BuOOO); 167.4 & 167.1 (COOEt); 159.8 (OC-CON) ; 155.9 
(NCH 2 CON) ; 150.4 (NCON); 145.9 (COCBr-CHN) ; 94.5 (C0CBr=CHN) ; 
5 78.2 (Me^C) ; 61.3 & 60.7 (COCH 2 CH 3 ) ; 49.1 & 48.0 (N£H 2 COOH) ; 

48.0 & 47.0 (NCH 2 CON) ; 3 8.6 (BocNHCH 2 CH 2 N) ; 38.2 (BocNHCH 2 CH 2 N) ; 
26.3 (C(CH 3 ) 3 ); 14.1 (COCH^Hj) . UV (Methanol; ^ NM) : 226; 
280. IR (KBr, CM* 1 ): 3200ms, broad (_NH) ; 168vs, vbroad 
LOO, COOH, CONH) ; 1250s (_ c " 0 ' COOEt) ; 1170s (_C-0, COO c Bu) ; 

10 859m (d CH, Br-C=C-H) . FAB-MS m/z (assignment, relative 
intensity): 479/477 (M + H, 5); 423/421 (M + 2H - 'Bu, 8); 
379/377 <M + 2H - Boc, 100); 233/231 (M - backbone, 20). 

EXAMPLE 44 

N- (Boc-aminoethyl) -N- ( 5-bromouracil-N- 1-methylenecarbonyl) - 
15 glycine (44) 

N * (Boc-aminoet hyl ) - N - ( 5 - br omour aci 1 - N - 1 - 
methylenecarbonyl) ethyl ester (43, 1.96 g; 4.11 mniol) was 
dissolved in methanol (30 ml) by heating, and then cooled to 
0°C. Sodium hydroxide (2M, 30 ml) was added, and the mixture 

20 was stirred for 30 min. HCl (1M, 70 ml) was added to pH = 2.0. 
The water phase was extracted with ethyl acetate (3 x 65 ml + 
7 x 40 ml) . The combined ethyl acetate extractions were washed 
with saturated NaCl-solution (500 ml) . The ethyl acetate phase 
was dried over magnesium sulphate, filtered and evaporated to 

25 dryness in vacuo. Yield: 1.77 g (96%). Mp. 92°-97°C. Anal, for 
C 1£ H 21 BrN 4 0 7 . Found (calc): C: 40.79 (40.10); H: 5.15 (4.71); 
Br; 14.64 (17.70); N: 11.35 (12.47). *H-NMR (DMSO-d 6 , 250 MHz, 
J in Hz): 12.83 <1H, s , COOH) ; 11.93 & 11.91 (1H, s, C=ONHC=0) ; 
8.10 & 8.07 (1H,S,C=C-H); 7.00 & 6.81 (1H, t , b, BocNH) ; 4.79 & 

30 4.61 (2H,s,NCH 2 CON) ; 4.37 & 4.25 (2H, S , NCH 2 COOH) ; 3.46-3.39 
(2H,m's, BocNHCH 2 CH 2 N) ; 3.26-3.23 & 3.12-3.09 (2H,m's, 
BocNHCHjO^N) ; 1.46 OH^s^Bu) . i3 C-NMR 9DMSO-d 6 ,250 MHz) ; 170.4 
( c BuOC=0) ; 166.9(COOH); 159.7 (C=C-C0N) ; 155.8 (NCH 2 CON) ; 150.4 
(NCON); 145.9 (C0CBr«CHN); 94 . 4 (COCBr-CHN) ; 78.1 (Me 3 C) ; 49.1 

35 & 48.0 (NCH 2 COOH> ; 47.7 & 47.8 (NCH 2 C0N) ; 38.6 (BocNHC 2 CH 2 N) ; 

38.1 (Boc NHCH 2 CH 2 N) ; 28.2 (C(CH 3 ) 3 ). UV (Methanol; max nm) ; 226; 
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278. IR (KBr^m" 1 ) : 3194ms, broad (_NH) ; 1686vs, vbroad (_C=0 
COOH, CONH) ; 1250s (_C-0,COOH); 1170s (_C-0, COO c Bu) ; 863m (d 
CH, Br-C=C-H) . FAB-MS m/z (assignment, relative intensity) : 

449/451 (M + H # 70); 349/351 (M + 2H -Boc, 100); 231/233 (M - 
5 backbone, 20) . 

EXAMPLE 45 

Synthesis of PNA Oligomers by Solid Phase, General Procedur 

The functionalized resin is measured out to typically 
provide 0.1-1.0 millimoles of functionality, (functionalities 

10 attached to resins are commercially available through various 
sources e.g. Peptides International, Kentucky). This weight 
of resin is suspended in a 1:1 (v:v) dichloromethane : dimethyl - 
formamide solution (5mL/lgm of resin) and allowed to swell for 
a period of time if desired. The solvent is then removed by 

15 filtration and the resin resuspended in trif luoroacetic acid 
(lmL/lgm of resin) and shaken for 2 minutes. The trif luoro- 
acetic acid is removed by filtration and this step is repeated 
once. The resin is washed three times with a solution of 1:1 
(v:v) dichloromethane :dimethylformamide. The resulting resin 

2 0 is resuspended in pyridine solution (5mL/lgm of resin) and 
vacuum filtered to remove the pyridine. This step is repeated 
once. This is followed by resuspension followed by filtration 
(designated "washing") using 1:1 (v:v) dichloromethane : di- 
methyl formamide solution (5mL/lgm of resin) this washing step 

25 is repeated twice. The resin is suspended in 1:1 (v:v) 
pyridine : dimethyl formamide and to this suspension is added the 
desired PNA monomer (2-10 molar equivalents), TBTU (1.9-9.9 
molar equivalents) , and di-isopropylethylamine (5-20 molar 
equivalents) such that the final concentration of PNA monomer 

30 is 0.2M. The suspension is shaken for 15-60 minutes and the 
spent coupling solution is removed by filtration. The resin 
is washed with pyridine three times, and any unreacted amines 
are capped using Rapoport's Reagent, 5 equivalents in DMF for 
5 minutes. The resin is then washed three times with pyridine 

35 followed by three washes with a solution of 1:1 (v:v) dichloro- 
methane: dimethyl formamide (5mL/lgm of resin) . At this point, 
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the resin is ready for the next coupling reaction and this 
procedure is repeated until the desired PNA is assembled on the 
resin. 



10 



Specific Examples of Amino Ethyl Glycine (aeg-) PNAs and aeg 
PNA Derivatives Prepared by this General Method 



Resin Employed 



Merrif ield 

Lys Substituted 
Merrif ield 

MBHA 



aeg* PNA /aeg -PNA Derivative 
Prepared 

H 2 N-GCAT-COOH {SEQ ID NO:l) 
H 2 N- GCAT -Lys -COOH (SEQ ID NO : 2 ) 



H 2 N-GCAT-CONH 2 (SEQ ID NO: 3) 



Lys Substituted MBHA H 2 N- GCAT- Lys -CONH 2 

NO: 4) 



(SEQ 



ID 



EXAMPLE 46 
15 Capping of the PNA 

PNA can be capped by a non-PNA moiety on the N 
terminus by following the procedures described in Example 45 
and substituting a desired carboxylic acid-based capping 
reagent for the PNA monomer in the final coupling step. 



2 0 Specific Examples of (aeg) PNAs and 
Prepared by this General Method 



25 



30 



35 



aeg PNA Derivatives 



Resin Employed 



aeg -PNA/ aeg* PNA Derivative 
Prepared 



Capping Reagent = Acetyl 



Merrif ield 



Lys Substituted 
Meriif ield 

Merrif ield 



CH3CONH-GCAT- COOH (SEQ ID NO: 5) 
H 2 N-GCAT-COOH (SEQ ID NO: 6) 

H 2 N- GCAT - Lys - COOH (SEQ ID NO: 7) 



H,N-GCAT-CONH 2 (SEQ ID NO: 8) 



Lys Substituted MBHA H 2 N- GCAT- Lys -CONH 2 (SEQ ID 

NO: 9) 



Lys Substituted 
Merrif ield 



CH 3 CONH-GCAT-Lys-COOH (SEQ ID 
N0:10) 

H 2 NGCAT-COOH (SEQ ID NO: 11) 
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10 



Lys Substituted 
Merrif ield 

Merrif ield 

MBHA 



H 2 N-GCAT-Lys-COOH (SEQ ID NO: 12) 



H 2 N-GCAT-CONH 2 (SEQ ID NO: 13) 
H 2 N-GCAT-CONH 2 (SEQ ID NO: 14) 



Lys Substituted MBHA H 2 N-GCAT-Lys-CONH 2 (SEQ ID 

NO:15) 



MBHA 



CH 3 CONH-GCAT-CONH 2 (SEQ ID 
NO:16) 

H 2 N-GCAT-CONH 2 (SEQ ID NO: 17) 



Lys Substituted MBHA CH 3 CONH-GCAT-Lys-CONH 2 (SEQ ID 

NO: 18) 



Capping Reagent = N-Boc glycine 



15 



20 



25 



Merrif ield 



Lys Substituted 
Merrif ield 

MBHA 



BocGly-GCAT-COOH (SEQ ID 
NO:19) 

BocGly-GCAT-Lys-COOH(SEQ ID 
NO: 20) 

BocGly-GCAT-CONH 2 (SEQIDNO:21) 



Lys Substituted MBHA BocGly-GCAT-Lys-CONH 2 (SEQ ID 

NO:22) 



Capping Reagent = 1. Glycine; 2. Cholic Acid (Choi) 



Merrif ield 



Lys Substituted 
Merrif ield 

MBHA 



Chol-GlyGCAT-COOH (SEQ ID 
NO:23) 

Chol-GlyGCAT-Lys-COOH (SEQ ID 
NO: 24) 

Chol-GlyGCAT-CONH 2 (SEQ ID 
NO: 25) 



Lys Substituted MBHA Choi -GlyGCAT- Lys -CONH 2 (SEQ ID 

NO: 26) 



EXAMPLE 47 

3 0 Lys /Aha linked Bis aeg-PNA preparation 

H-Gly-TTC-TCT-CTC-T-Lys-Aha-Iiys-Aha-Lys-T-CTC-TCT-CTT-Lys-NH 3 
(SEQ ID NO:27) 

The first ten aeg-PNA monomeric units were coupled by 
coupling an aeg-T monomeric unit to a lysine-MBHA resin via 
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standard solid phase methods as per the procedures of Example 
45, using TBTU activation resulting in a resin-bound PNA 
monomer containing amino terminal t-butyloxycarbonyl (BOC) 
protection. In an iterative process the other 9 aeg-PNA 
5 monomeric units were coupled. The terminal aeg-PNA contains 
an amino terminal t-butyloxycarbonyl (BOC) protection group. 

The support was washed four times with N,N-dimethyl- 
formamide/ dichloromethane (1:1) and then treated twice with 
5% m-cresol in trif luoroacetic acid (3 mL) with shaking for 

10 three minutes each time. The support was washed again with N,N- 
dimethylformamide/ dichloromethane (1:1) and then with 
pyridine . To a vial was added t-butyloxycarbonyl -N- e - (2 - 
chlorobenzyloxycarbonyl) -L- lysine (200 mmoles) and O- 
(benzotriazol -1 -yl ) -1,1,3, 3 - tetramethyluronium tetra- 

15 fluoroborate (180 mmoles) . N,N-Dimethylf ormamide (1 mL) and 
pyridine (1 mL) were added to the vial followed by N,N- 
diisopropylethylamine (400 mmoles) * The vial was shaken until 
all solids were dissolved. After one minute the contents of 
the vial were added to the peptide synthesis vessel and shaken 

20 for 20 minutes. The reaction solution was then drained away 
and the support washed five times with pyridine. Remaining 
free amine was capped by addition of a 10% solution of N- 
benzyloxycarbonyl-N' -methyl -imidazole trif late in N,N- 
dimethylformamide (1.5 mL) . After shaking for five minutes, 

25 the capping solution was drained and the support washed five 
times with pyridine. 

The remainder of the linker was prepared by the 
sequential coupling (as above) of N- t-butyloxycarbonyl- e -amino- 
hexanoic acid, t-butyloxycarbonyl -N- € - (2-chlorobenzyl- 

30 oxycarbonyl) -L-lysine, N- t-butyloxycarbonyl- e-aminohexanoic 
acid, and t-butyloxycarbonyl -N-e- (2 -chlorobenzyloxycarbonyl) -L- 
lysine. The resulting oligomer consisting of a decamer aeg-PNA 
containing an amino terminal linker with a t-butyloxycarbonyl 
cap was then extended for the remaining 10 PNA units again via 

35 standard solid phase methods. Cleavage off of the support and 
HPLC purification was as described for standard PNA oligomers. 
The title compound was determined by electrospray mass 
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spectrometry to have the expected molecular weight of 6012 
daltons. The thermal stability of the PNA/DNA triplex formed 
by this bis-PNA and its target was found to be greater than the 
PNA/DNA triplex formed by the corresponding single PNA and 
5 target (Tm = 89 °C for bis vs. 85 °C for single). Mass 
spectrometry also demonstrated that the bis-PNA formed a 1:1 
complex with target while the single PNA formed a 2:1 complex 
with its target. 



EXAMPLE 47a 

10 Lye/Amino Cie-Hexenoic Acid linked Bis aeg-PNA preparation 

H-Gly-TTC-TCT-CTC-T-Lys-Achea-Lys-Achea-Lys-T-CTC-TCT-CTT-Lys- 

NH 2 (Achea=Amino Cie-hexenoic acid) (SEQ ID NO: 28) 

The first ten aeg-PNA monomeric units are coupled by 
coupling an aeg-T monomeric unit to a lysine-MBHA resin via 

15 standard solid phase methods as per the procedures of Example 
45, using TBTU activation resulting in a resin-bound PNA 
monomer containing amino terminal t-butyloxycarbonyl (BOC) 
protection. In an iterative process the other 9 aeg-PNA 
monomeric units are coupled. The terminal aeg-PNA contains an 

20 amino terminal t-butyloxycarbonyl (BOC) protection group. 

The support is washed four times with N,N-dimethyl- 
formamide/dichloromethane (1:1) and then treated twice with 5% 
jn-cresol in trif luoroacetic acid (3 mL) with shaking for three 
minutes each time. The support is washed again with N,N- 

25 dimethyl formamide/ dichloromethane (1:1) and then with 
pyridine. To a vial is added t-butyloxycarbonyl -N-e- (2- 
chlorobenzyloxycarbonyl) -L- lysine (200 mmoles) and O- 
(benzotriazol-l-yl) -1,1,3 , 3 - tetramethyluronium tetra- 
fluoroborate (180 mmoles) . N, N-Dimethyl formamide (1 mL) and 

3 0 pyridine (1 mL) were added to the vial followed by N,N- 
diisopropylethylamine (400 mmoles) . The vial is shaken until 
all solids are dissolved. After one minute the contents of the 
vial are added to the peptide synthesis vessel and shaken for 
20 minutes. The reaction solution is then drained away and the 
3 5 support washed five times with pyridine. Remaining free amine 
is capped by addition of a 10% solution of N-benzyloxycarbonyl- 
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N' -methyl -imidazole triflate in N,N-dimethylf ormamide (1.5 mL) . 
After shaking for five minutes, the capping solution is drained 
and the support washed five times with pyridine. 

The remainder of the linker is prepared by the 
5 sequential coupling (as above) of N- t-butyloxycarbonyl -e-amino- 
hexenoic acid, t-butyloxycarbonyl -N-e- (2-chlorobenzyl- 
oxycarbonyl ) -L- lysine , N- t-butyloxycarbonyl - e -aminohexenoic 
acid, and t-butyloxycarbonyl -N-e- (2-chlorobenzyloxycarbonyl) -L- 
lysine. The resulting oligomer consisting of a decamer aeg-PNA 
10 containing an amino terminal linker with a t-butyloxycarbonyl 
cap is then extended for the remaining 10 PNA units again via 
standard solid phase methods. Cleavage off of the support and 
HPLC purification is as described for standard PNA oligomers. 

EXAMPLE 47b 

15 Lys /Amino Hexynoic Acid linked Bis aeg-PNA preparation 

H-Gly-TTC-TCT-CTC-T-Lys-Ahya-Lys-Ahya-Lys-T-CTC-TCT-CTT-Lys-NHj 
(AhyaoAmino hexynoic acid) (SEQ ID NO: 29) 

The first ten aeg-PNA monomeric units are coupled by 
coupling an aeg-T monomeric unit to a lysine-MBHA resin via 

20 standard solid phase methods as per the procedures of Example 
45, using TBTU activation resulting in a resin-bound PNA 
monomer containing amino terminal t-butyloxycarbonyl (BOO 
protection. In an iterative process the other 9 aeg-PNA 
monomeric units are coupled. The terminal aeg-PNA contains an 

25 amino terminal t-butyloxycarbonyl (BOC) protection group. 

The support is washed four times with N,N-dimethyl- 
formamide/dichloromethane (1:1) and then treated twice with 5% 
m-cresol in trif luoroacetic acid (3 mL) with shaking for three 
minutes each time. The support is washed again with N,N- 

30 dimethylf ormamide/ dichloromethane (1:1) and then with 
pyridine. To a vial is added t-butyloxycarbonyl -N-e- (2- 
chlorobenzyloxycarbonyl ) - L- lysine (200 mmoles) and O- 
(benzotriazol-l-yl) -1,1,3, 3 - tetramethyluronium tetra- 
fiuoroborate (180 mmoles) . N, N-Dimethylf ormamide (1 mL) and 

3 5 pyridine (1 mL) are added to the vial followed by N,N- 
diisopropylethylamine (400 mmoles) . The vial was shaken until 
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all solids are dissolved. After one minute the contents of the 
vial are added to the peptide synthesis vessel and shaken for 
20 minutes. The reaction solution is then drained away and the 
support washed five times with pyridine. Remaining free amine 
5 is capped by addition of a 10% solution of N- benzyl oxycarbonyl- 
N' -methyl -imidazole triflate in N,N-dimethylf ormamide (1.5 mL) . 
After shaking for five minutes, the capping solution is drained 
and the support washed five times with pyridine. 

The remainder of the linker is prepared by the 

10 sequential coupling (as above) of N- t-butyloxycarbonyl -e -amino - 
hexynoic acid, t-butyloxycarbonyl -N-€- (2-chlorobenzyl- 
oxycarbonyl ) -L-lysine , N- t-butyloxycarbonyl - e -aminohexynoic 
acid, and t-butyloxycarbonyl -N- 6- (2-chlorobenzyloxycarbonyl) -L- 
lysine. The resulting oligomer consisting of a decamer aeg-PNA 

15 containing an amino terminal linker with a t-butyloxycarbonyl 
cap is then extended for the remaining 10 PNA units again via 
standard solid phase methods. Cleavage off of the support and 
HPLC purification is as described for standard PNA oligomers. 

EXAMPLE 47c 

2C Lys /Me ta- Amino Benzoic Acid linked Bis aeg-PNA preparation 

H-Gly-TTC-TCT-CTC-T-Lys-MABA-Lys-MABA-Lys-T-CTC-TCT-CTT-Lys-NH 2 

(SEQ ID NO: 30) (MABAeMeta -Amino Benzoic Acid) 

The first ten aeg-PNA monomer ic units are coupled by 
coupling an aeg-T monomeric unit to a lysine-MBHA resin via 
25 standard solid phase methods as per the procedures of Example 
45, using TBTU activation resulting in a resin-bound PNA 
■ monomer containing amino terminal t-butyloxycarbonyl (BOC) 
protection. In an iterative process the other 9 aeg-PNA 
monomeric units are coupled. The terminal aeg-PNA contains an 
3 0 amino terminal t-butyloxycarbonyl (BOC) protection group. 

The support is washed four times with N,N-dimethyl- 
formamide/dichloromethane (1:1) and then treated twice with 5% 
m-cresol in trif luoroacetic acid (3 mL) with shaking for three 
minutes each time. The support is washed again with N,N- 
35 dimethyl f ormamide/ dichloromethane (1:1) and then with 
pyridine. To a vial is added t-butyloxycarbonyl -N- €- (2- 
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chlorobenzyloxycarbonyl) -L-lysine (200 mmoles) and 0- 
(benzotriazol-l-yl) -1,1,3, 3 -tetramethyluronium tetra- 
fluoroborate (180 mmoles) . N, N-Dimethylf ormamide (1 mL) and 
pyridine (1 mL) are added to the vial followed by N,N- 
5 di isop ropy 1 ethyl amine (400 mmoles) . The vial was shaken until 
all solids are dissolved. After one minute the contents of the 
vial are added to the peptide synthesis vessel and shaken for 
20 minutes. The reaction solution is then drained away and the 
support washed five times with pyridine. Remaining free amine 

10 is capped by addition of a 10% solution of N-benzyloxycarbonyl - 
N' -methyl -imidazole triflate in N,N-dimethy If ormamide (1.5 mL) . 
After shaking for five minutes, the capping solution is drained 
and the support washed five times with pyridine. 

The remainder of the linker is prepared by the 

15 sequential coupling (as above) of N- t-butyloxycarbony-meta- 
aminobenzoic acid, t-butyloxycarbonyl-N-e- (2 -chlorobenzyl- 
oxycarbonyl) -L-lysine, N- t-butyloxycarbony-meta- aminobenzoic 
acid, and t-butyloxycarbonyl-N-e- (2 -chlorobenzyloxycarbonyl) -L- 
lysine. The resulting oligomer consisting of a decamer aeg-PNA 

20 containing an amino terminal linker with a t- butyl oxycarbonyl 
cap is then extended for the remaining 10 PNA units again via 
standard solid phase methods. Cleavage off of the support and 
HPLC purification is as described for standard PNA oligomers. 

EXAMPLE 48 

25 Lys/Aha linked Bis aeg-PNA preparation 

H-Gly-TCT-TTT-Lys-Aha-Lya -Aha-Lys-TTT-TCT-TTT-Lys-CONH 2 (SEQ ID 
NO: 31) 

The title Lys/Aha linked aeg-PNA was synthesized as 
per the procedures of Example 47 except the polystrene 
3 0 polyethylene glycol copolymer resin "Tentagel Resin" was used 
as the synthetic support. 
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EXAMPLE 49 

Lys/Aha linked Bie aeg-PNA preparation H-Gly-TTT-TGT-TTT-Lys- 
Aha-Lys-Aha-Lys-TTT-TCT-TTT-Lys-CONH a (SEQ ID NO: 32) 

The title Lys/Aha linked aeg-PNA was synthesized as 
5 per the procedures of Example 47 except the polystrene 
polyethylene glycol copolymer resin "Tentagel Resin" was used 
as zhe synthetic support. 

EXAMPLE 50 

Lys/Aha linked Bis aeg-PNA preparation 
10 H-Gly-TTT-TCT-TTT-Lys-Aha-Lys-Aha-Lys-TTT-TCT-TTT-Lys-CONH 2 

(SEQ ID NO:33) 

The title Lys/Aha linked aeg-PNA was synthesized as 
per the procedures of Example 47 except the polystrene 
polyethylene glycol copolymer resin "Tentagel Resin" was used 
15 as rhe synthetic support. 

EXAMPLE 51 

Lys/Aha linked Bis aeg-PNA Having Pseudoisocytosine (J) 
H-Gly-TTJ-TJT- JTJ-T-Lys-Aha-Lys-Aha-Lys-T-CTC-TCT-CTT-Lys-NH 2 
(SEQ ID NO: 34) 

20 The title Lys/Aha linked bis aeg-PNA is synthesized 

as per the procedures of Example 47 . Pseudoisocytosine 11 (J) " 
mcr.omeric units are synthesized as per the procedures of 
examples 26 thru 32 and are used in place of cytosine monomeric 
uni-s to give the title compound. 

25 EXAMPLE 52 

Lys/Aha linked Bis aeg-PNA Having Pseudouracil (W) 
H-Gly-G^UA-GAW-JAC-^U-Lys-Aha-Lys-Aha-Lys-GUA-GAU-CAC-U-Lys- 
NH ; (SEQ ID NO:35) 

The title Lys/Aha linked bis PNA is synthesized as per 
30 the procedures of Example 47. Pseudouracil "*U" monomeric 
ur.izs are synthesized as per the procedures of examples 33 thru 
36 and are used in place of some of the uracil monomeric units 
tc give the title compound. 
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EXAMPLE 53 

Lys/Aha linked Bie aeg-PNA Having isocytosine (iC) 
H-Gly-TT±C-TiCT-iCTiC-T-Lys-Aha-Lys ~Aha-Lys-T-CTC-TCT-CTT-Lys- 
NH 2 (SEQ ID NO:36) 

5 The title Lys/Aha linked bis aeg-PNA is synthesized 

as per the procedures of Example 47. Aeg- isocytosine "iC" 
monomeric units are synthesized as per the procedures of 
Examples 37 thru 40 and are used in place of some of the aeg- 
cytosine monomeric units to give the title compound. 

10 EXAMPLE 54 

Lys/Aha linked Bis aeg-PNA Having 5-Bromouracil (5BrU) 

H - Gly - G5 BrUA - GA5BrU - JAC - SBrU-Lys - Aha - Lys - Aha - Lys - GUA-GAU- CAC-U- 

Lys-NH, (SEQ ID KO:37) 

The title Lys/Aha linked bis aeg-PNA is synthesized 
15 as per the procedures of Example 47. aeg- 5 -bromo- uracil 
monomeric units are synthesized as per the procedures of 
Examples 41 thru 44 and are used in place of some of the aeg- 
uracil monomeric units to give the title compound. 

EXAMPLE 55 
20 egl linked Bis aeg-PNA Preparation 

H-TCT-CTT-T-egl-egl-egl-TTT-CTC-T-Lys-NH 2 (SEQ ID NO: 38) 
(egl = -NH-CH a -CH 2 -0-CH 2 -CH a -0-CH 2 -C(*0) -) 

The protected bis aeg-PNA was assembled on a Boc- 
Lys(ClZ) functional ized MBHA resin with a substitution of 

25 approximately 0.10 mmol/g. The synthesis was initiated on 200 
mg (dry weight) of t-Boc-Lys (C1Z) -MBHA resin, preswollen 
overnight in dichloromethane . The following steps were 
repeated until the desired sequence was obtained: (1) removal 
of the N-terminal t-Boc protecting group by treatment with 95:5 

30 TFA/m-Cresol (2x4 min, 1 ml); (2) wash with 1:1 
DMF/dichloromethane (3xlmin, 1 ml) ; (3) wash with pyridine 
(2xlmin,l ml); (4) HBTU (18.0 mg, 0.48 mmol) and monomer (0.5 
mmol, t-Boc-C z OH (25.1 mg) , t-Boc-T-OH (19.2 mg) or t-Boc-egl- 
OH (13.1 mg) ) was taken up in 1:1 DMF/pyridine (in the case of 

3 5 t-Boc-egl-OH neat DMF was used) and added DECA (16 ml, 1 mmol) 
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to a final volume of 0.5 ml and the mixture was allowed to 
preactivate for 1 minute before addition to the resin where the 
coupling was allowed to proceed for 20 min at room temperature; 
(5) a few beads were removed for qualitative Kaiser test 
5 (Ninhydrin) ; (6) Wash with pyridine (2x1 min, 1 ml) ; (7) 
acylation with Rappoport's reagent (lOOmg, 0.28 mmol) in DMF 
(1 ml); (8) Wash with 8:2 DMF/pipiridine; (9) wash with 
pyridine (3x1 min, 1 ml); (10) Wash with 1:1 
DMF /dichlorome thane (3xlmin, 1 ml) . 

10 When the desired sequence was obtained the resin was 

washed with neat dichloromethane (3x1 min # 1.5 ml) and then 
dried in a desiccator under vacuum. All qualitative Kaiser- 
tests were yellow with no coloration of the beads. 

The bis aeg-PNA was cleaved from the resin and the 

15 permanent protection groups were removed. A solution of 1:8:1 
TFA/DMS/m-cresol (50 fih) and a solution oJE 9:1 TFA/TFMSA (50 
fih) were cooled on an icebath and added per 10 mg of dry resin. 
The reaction was allowed to proceed for ihour at room 
temperature and the resin was drained and washed with neat TFA 

20 (lxl min, 1 ml). A solution of 8:1:1 TFA/TFMSA/ m- cresol (100 
fih) (cooled on an icebath) was added per 10 mg of dry resin. 
The reaction was allowed to proceed for 2 hours and the resin 
was drained and washed with TFA (lxl min, 1 ml) . The two TFA 
solutions combined and the aeg-PNA was precipitated by addition 

25 of dry ether. The precipitate was washed four times with dry 
ether. Yield: 12.7 mg (Purity>90%, purified by RP-HPLC, 
^Bondapak C18) . MS ( FAB+ ) m/z: : (found/calcd) 4249/4247 

EXAMPLE 55a 

egl linked Bis aeg-PNA Preparation 
3 0 H-TTT-TCC-TCT-C-egl-egl-egl-CTC-TCC-TTT-T-Lys-NH 2 (SEQ ID 
NO: 39) (egl=-NH-CH 2 -CH 2 -0-CH 2 -CH 2 -0-CH 2 -C(=0) -) 

The title egl linked bis aeg-PNA was synthesized 
according to the procedures of Example 55. 
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EXAMPLE 55b 

gl linked Bis aeg-PNA Preparation 
H-GTA-GAT-CA-egl- gl- gl-TGA-TCT-AC-Lys-NH 2 (SEQ ID NO:40) 
(egl»-NH-CH 2 -CH 2 -0-CT a -CH 2 -0-CH a -C(«0) -) 

5 The title egl linked bis aeg-PNA was synthesized 

according to the procedures of Example 55. 

EXAMPLE 56 

egl linked Bis aeg-PNA Having Peeudoisocytosine (J) 
H-TJT- JTT-T-egl-egl-egl-TTT-CTC-T-Lys-HH 2 (SEQ ID NO:41) 
10 (egl«-NH-CH,-CH a -0-CH 2 -CH 3 -O.CH 2 -C(aO) -) 

The protected aeg-PNA was assembled on a Boc-Lys (C1Z) 
modified MBHA resin with a substitution of approximately 0*10 
mmol/g. The synthesis was initiated on 100 mg (dry weight) of 
t-Boc-Lys (C1Z) -MBHA resin, preswollen overnight in dichloro- 

15 methane. The bis aeg-PNA was synthesized as per the procedures 
of Example 55. In step (4) the aeg-pseudoisocytosine monomer 
of Examples 26 thru 32 (25.1 mg, 0.5 mmol) was used for the 
incorporation of the aeg-J unit. The bis aeg-PNA was cleaved 
from the resin as per the procedures of Example 45. Yield: 5.5 

20 mg (purity> 90%, purified by RP-HPLC, /iBondapak C18) . MS (FAB+) 
m/z: : (found/calcd) 4748/4747. 

Using the procedures of this Example the following 
additional egl linked Bis aeg-PNAs Having Pseudoisocytosine 
(J! , were synthesized: 

25 H - TT J - T J J - TT - egl - egl - eg 1 * TTC - CTC - TT - Ly s - NH 2 (SEQ ID 

NO:42) ; 

H-TTJ- JJT-TT-egl-egl -egl -TTT- JJJ-TT-Lys-NH 2 (SEQ ID 

NO: 43) ; 

H-TTJ-TJJ-TTT-egl-egl-egl-TTT-CCT-CTT-NH 2 (SEQ ID 

30 N0:44); 

H-TTT- JJT-T-egl -egl-egl-TTC-CTT-T-NH 2 (SEQ ID NO: 45) ; 
H-TTT-TJJ-TJT-J-egl-egl-egl-CTC-TCC-TTT-T-Lys-NH 2 (SEQ 

ID NO: 46) ; 

H-TTT-TJJ-TJT-JJJ-TJT-egl-egl -egl -TCT-CCC-TCT-CCT-TTT- 
35 Lys-NH 2 (SEQ ID NO:47) / 
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H-TTJ-TTJ-TTT-T-egl-egl-egl-TTT-TCT-TCT-T-Lys-NH 2 (SEQ 
ID NO:48) ; 

H-CTT-TTT-TCT-T-egl-egl-egl-TTJ-TTT-TTT- J-Lys-NH 2 (SEQ 
ID NO:49) ; 

5 H-CTC-TTC-TTT-C-egl-egl-egl-JTT-TJT-TJT-J-Lys-NH 2 (SEQ 

ID NO:50) ; 

EXAMPLE 56a 

PNA Having Pseudoisocytosine (J) 
H-T 3 JTJT-Lys-NH 2 (SEQ ID NO: 51) 

10 The title compound was synthesized according to the 

procedures in Example 56. No linker was incorporated for this 
aeg-PNA having the pseudoisocytosine (J) base. 

EXAMPLE 57 

egl linked Bis aeg-PNA Having Pseudoisocytosine (J) 
15 H - TCT - CTT - T - egl - egl - egl - TTT - JT J - T - Ly s - NH 2 (SEQ ID N0:52) 
(egl«-NH-CH 2 -CH a -0-CH 2 -CH 2 -0-CH 2 -C(«=0) -) 

The protected aeg-PNA was assembled on a Boc-Lys(ClZ) 
modified MBHA resin with a substitution of approximately 0.10 
mrr.ol/g. The synthesis was initiated on 100 mg (dry weight) of 

20 t-Boc-Lys (ClZ) -MBHA resin, preswollen overnight in 
die hi orome thane . The bis aeg-PNA was synthesized as per the 
procedures of Example 55. In step (4) aeg-pseudoisocytosine 
monomer of Examples 26 thru 32 {25.1 mg, 0.5 mmol) was used for 
the incorporation of the aeg-pseudoisocytosine unit. The bis 

25 aeg-PNA was cleaved from the resin as per the procedures of 
Example 45. Yield: 2 . 8 mg (purity >90%, purified by RP-HPLC, 
M^cndapak C18) . MS (FAB+) m/z: : (found/calcd) 4749/4747. 

EXAMPLE 58 

Synthesis of aeg-PNA 
30 H-TCT-CTT-T-Lys-NH 2 (SEQ ID NO:53) 

The title aeg-PNA was synthesized as per the 
procedures of Example 45. 



WO 96/02558 



PCT/US95/09084 



- 85 - 

EXAMPLE 59 

aeg-PNA Oligomer Having Pseudoisocytosin (J), and 
Pseudouridin <¥U) , H - OWA - GA*U - JA J - *U - Lye - NH 2 (SEQ ID NO: 54) 

The protected aeg-PNA oligomer was assembled on a Boc- 
5 Lys(ClZ) modified MBHA resin with a substitution of 
approximately 0.10 mmol/g. The synthesis was initiated on 100 
mg (dry weight) of t-Boc-Lys (ClZ) -MBHA resin, preswollen 
overnight in dichloromethane. The aeg-PNA oligomer was 
synthesized as per the procedures of Example 55. In step (4) 

10 aeg-pseudoisocytosine monomer of Examples 26 thru 32 (25.1 mg, 
0.5 mmol) and the aeg-pseudouracil monomer of Examples 33 thru 
36 (19.2 mg, 0.5 mmol) were used for the incorporation of the 
aeg-pseudoisocytosine and aeg-pseudouracil monomeric units. 
The aeg-PNA was cleaved from the resin as per the procedures 

15 of Example 45. Yield: 5.8 mg (purity >90%, purified by RP- 
HPLC, /xBondapak C18) . MS (FAB+) m/z ; : (found/calcd) 2811/2811 

EXAMPLE 60 

aeg-PNA Oligomer Having aeg-Ieocytosine (aeg-iC) , and aeg- 
Ps eudoi socy tos ine ( a eg- J) 
20 H-TiCC-iCTC-JCT-J-Lys-NH 2 (SEQ ID NO: 55) 

The protected aeg-PNA was assembled on a Boc-Lys (ClZ) 
modified MBHA resin with a substitution of approximately 0.10 
mmol/g. The synthesis was initiated on 100 mg (dry weight) of 
t-Boc-Lys (ClZ) -MBHA resin, preswollen overnight in dichloro- 

25 methane. The aeg-PNA -oligomer was synthesized as per the 
procedures of Example 45. In step (4) aeg-pseudoisocytosine 
monomer from Examples 26 thru 32 (25.1 mg, 0.5 mmol) and the 
aeg-isocytosine monomer from Examples 37 thru 40 (25.1 mg, 0.5 
mmol) were used for the incorporation of the aeg-pseudoisocyto- 

30 sine and the aeg-isocytosine monomeric units. The aeg-PNA was 
cleaved from the resin as per the procedures of Example 45. 
Yield: 5.8 mg (purity >90%, purified by RP-HPLC, fiBondapak 
C18) . MS (FAB+) m/z: : (found/calcd) 2702/2701 
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EXAMPLE 61 

1-Carboxym thyl Pyrimidine-2-on 

The title compound was prepared using pyrimidine-2-one 
following the procedures of Example 23 . 

5 EXAMPLE 62 

N- ( [l-Pyrimidine-2-one] -acetyl) -N- (2-Boc-aminoethyl) -glycine 
[BocPy monomer] 

1-Carboxymethyl pyrimidine*-2-one (Example 61, 1 eq) 
and Boc-aminoethylglycine ethyl ester (2.9 g) was dissolved in 

10 DMF {50 Ml). Dhbt-OH (l.l eq) was added and the mixture was 
cooled in an ice bath. DCC (1.2 eq, 2.9g) was added and the 
mixture was stirred over night at room temperature. The DCU 
was filtered off and washed with DMF. The DMF layers were 
combined and evaporated in cacuo. The resulting residue was 

15 dissolved in dichlorome thane and washed with saturated aqueous 
NaHC0 3/ KHS0 4 , and NaCl . The organic phase was dried over MgS0 4 
and evaporated in vacuo. The residue was purified by silica 
gel column chromatography to give the title compound. 

EXAMPLE 63 

20 a eg /Aha linked Bis aeg-PNA preparation 

H-T3PyT2- (aha)aeg(aha)T3AT3-Lys-NH 2 (SEQ ID NO:65) 

The title Bis-PNA oligomer was assembled in an 
iterative process using N- ( [l-pyrimidine-2-one] -acetyl) -N- (2- 
Boc-aminoethyl) -glycine [BocPy monomer] of Example 62 following 
25 the procedures of Example 47. 

EXAMPLE 64 

N-acetyl-N- (2 -Boc- amino ethyl) -glycine [BocAc monomer] 

The title compound was prepared using acetic acid 
following the procedures of Example 25. 
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EXAMPLE 65 

<Aha)aeg(Ac)Aha linked Bis a g-PNA preparation 
(Aha) -T10- (Aha) a g ( Ac ) Aha - Tl 0 - NH a (SEQ ID NO:66) 

The title Bis-PNA oligomer was assembled in an 
5 iterative process using N-acetyl-N- (2-Boc-aminoethyl) -glycine 
[BocAc monomer] of Example 64 following the procedures of 
Example 47 . 

EXAMPLE 66 

N- (4-pentynoyl) -N- (2-Boc-aminoethyl) -0- alanine [BocW monomer] 

10 The title compound was prepared using 4-pentynoic acid 

following the procedures of Example 25. 

EXAMPLE 67 

Bis aeg-PNA preparation Containing W 

H- T JTWT JWTTT - egl - egl - egl - TIT6CTGTCT - NH a (SEQ ID NO: 67) 

15 The title W containing Bis-PNA oligomer was assembled 

in an iterative process using N- (4-pentynoyl) -N- (2-Boc- 
aminoethyl) -0- alanine [BocW monomer] of Example 66 following 
the procedures of Example 47. 

EXAMPLE 68 

20 Bis aeg-PNA preparation Containing W 

H - T JTWT JWTTT - egl - egl - egl - TITACTATCT -NHj (SEQ ID NO: 68) 

The title W containing Bis-PNA oligomer was assembled 
in an iterative process using N- (4-pentynoyl) -N- (2-Boc- 
aminoethyl) -0- alanine [BocW monomer] of Example 67 following 
25 the procedures of Example 47, 

EXAMPLE 69 

N-Boc-4-aminobutyxic acid 

The title compound is prepared using 4 -amino -butyric 
acid following the procedure of Example 2 . 
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EXAMPLE 70 

N- (4-aminobutyryl) -N- (2-Boc-aminoethyl) -glycine [Bocaba 
monomer] 

The title compound was prepared using N-Boc-4- 
5 aminobutyric acid of Example 69 following the procedures of 
Example 25. 

EXAMPLE 71 

aba-egl-His- (jS-ala) -Asp linked Bis aeg-PNA preparation 
(E 3 N-GT6TGC aba-egl-His- (jS-ala) - Asp * GCACACG - Lye - NH a 
10 (SEQ ID NO:69) 

The title compound was prepared using the N-(4- 
aminobutyryl) -N- (2-Boc-aminoethyl) -glycine of Example 70 
following the procedures of Example 47. 

EXAMPLE 72 

15 N-Boc-N' - (2-Z-aminoethyl) ethylendiamine hydrochloride. 

Diethylenetriamine (45.16 ml; 0.418 mol) was dissolved 
in chloroform (400 ml) and a solution of tert-butyl-4- 
nitrophenyl carbonate (10.00g; 0.0418 mol) in chloroform (100 
ml) was added dropwise over a period of 3 hours at 0°C. 

20 Subsequently the reaction mixture was left overnight at room 
temperature . Precipitated nitrophenol was removed by 
filtration and washed twice with chloroform. The yellow oil 
obtained by evaporation of the filtrate to dryness, in vacuo, 
was dissolved in water (200 ml) and pH was adjusted to 3.5 with 

25 4 M hydrochloric acid at 0°C. The water phase was extracted 
with ethyl acetate (3 x 300 ml) followed by adjustment of pH 
tc 7 with 2 M sodium hydroxide and reduction of the volume to 
approximately 300 ml, in vacuo. The pH was then adjusted to 
12 with 2 M sodium hydroxide and the solution was extracted 

30 with ethyl acetate (8 x 300 ml) . The organic phase was washed 
with saturated aqueous sodium chloride (2 x 500 ml) , dried over 
magnesium sulphate, and evaporated to dryness, in vacuo. The 
resulting oil was dissolved in water {50 ml.), pH was adjusted 
5 at 0°C with 4 M hydrochloric acid, and the solution was 

35 evaporated to dryness under reduced pressure. The residue was 
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washed several times with ether and dried over sicapent, in 
vacuo. A portion of the dry compound (3.80 g; 0.138 mol) was 
dissolved in a mixture of water and dioxane (70 ml; 1 volume 
water to 1 volume dioxane) and pH was adjusted to 11 with 2 M 
5 sodium hydroxide. Subsequently a solution of benzyls- 
nit rophenyl carbonate (4.14 g; 0.015 mol) in dioxane (35 ml) 
was added dropwise over a period of 2 hours while pH was 
currently maintained at 11 by addition of 2 M sodium hydroxide. 
After the mixture had been stirred at room temperature 

10 overnight, pH was adjusted to 3.5 with 4 M hydrochloric acid 
at 0°C and the solution was shaken with ethyl acetate (4 x 100 
ml) . Between each shaking the precipitated product was 
filtered off, washed twice- with ethyl acetate and dried over 
sicapent, in vacuo. Yield 3.42 g (66%). Mp. 194-196°C. Anal. 

15 for C 17 H 2e N 3 0 44 Cl , found (calc) C: 53.77 (54.61) H: 7.46 (7.55) 
N: 11.07 (11.24) Cl : 9.63 (9.48). 'H-NMR (400 MH Z ; DMSO-d 6 ) ; 
01.47 (s. 9H, t-Bu) ; 3.02-3.09 (unresolved m. 4H, 2 x CH 2 ) ; 
3.28-3,39 (unresolved m. 4H.2 x CH 2 ) ; 5.12 (s, 2 H, C 6 H S -CH 2 -) ; 
7.10 (b, 1H, BocNH-); 7.44 (m. 5H, C 6 H 5 -CH 2 -); 7.55 (b. 1 H, 

20 ZNH-); 8.97 (b-CH 2 NH 2 ClCH 2 -) . MS-FABm/z338 (M +1) ; 282 (M-t-Bu+ 
1) . 

EXAMPLE 73 

N- (2 -2-Aminoethyl) -N- (2-Boc-aminoethyl) glycine ethyl ester. 

N-Boc-N' - (2-Z-aminoethyl) ethylendiaminehydrochloride 
25 (2.0 g; 0.0054 mol) was dissolved in a mixture of DMF (4 0 ml) 
and triethylamine (1.85 ml; 0.013 mol) and ethyl bromoacetate 
(0,72 ml; 0.0064 mol) was added. The reaction mixture was 
stirred overnight at room temperature. Subsequently water (40 
mol) was added and the solution was extracted with methylene 
30 chloride (2 x 50 ml) . The organic phase was extracted with 
saturated aqueous sodium chloride and dried over magnesium 
sulphate. Evaporation to dryness, in vacuo , resulted in a 
yellow oil which was purified by column chromatography on 
silica gel by eluting with 10% methanol in methylene chloride. 
35 This afforded the title compound as an oil. Yield 1.98 g 
(87%). Anal, for C 21 H 33 N 3 0 6 . found (calc) C: 58.38) (59.56) H: 
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8.01 (7.85) N: 10.06 (9.92). l H-NMR(400 MH 2 ; CDC1 3 :) 61.26 (t, 
3H, -CH 2 CH3) ; 1.41(s, 9H, t-Bu) ; 2.81(m, 4H, -CH^&NCHjCH^ ) ; 
3.19 (m, 2H, BocNH-Qk-) ; 3.28 (m, 2H, ZNH-CHj-); 3.43 (s, 2H, - 
CH 2 COOEt); 4.16 (q, 2H, -CH 2 C& 3 ) ; 5.11(s, 2H, C^-CH^) ; 5.21(b ( 
5 1H, BocNH- ) ; 5.72(b, 1H, ZNH-); 7.35(m, 5H, C € H 5 - CH 2 - ) , MSG- FAB 
m/z 424 (M+l) . 



EXAMPLE 74 

N- (2-Z-Aminoethyl) -N- (2 -aminoethyl) glycine ethyl ester 
hydrochloride . 

10 N- (2-Z-Aminoethyl) -N- (2-Boc-aminoethyl) glycine ethyl 

ester (3.2 g; 0.00756 mol) was dissolved in 1M hydrochloride 
in acetic acid (32 ml) and stirred at room temperature. After 
4 hours the solution was evaporated to dryness, in vacuo. 
Yield 2.33 g (83%). Mp. 152°C (decomp.). Anal, for 

15 C 16 H 27 N 3 0 4 C1 2 , 2H 2 0 found (calc.) C:47:00 (47.41) H: 6.74 (7.71) 
N:10.37 (10.37) Cl-.17.33 (17.49). 'H-NMR (400 MHz; DMSO-d 6 ) ; 
fll.29 (t, 3H, ~CH 2 CHj) ; 3.24 and 3.43 <m's, 8H, 2 x -CH 2 CH_ 2 -); 
4.15 (s, 2H, -CH 2 C00Et) ; 4,23<q, 2H, -CH 2 CH 3 ) ; 5.11(s f 2H, C 6 H 5 - 
CH 2 -); 743 <m, 5H, C 6 H 5 -CH 2 -); 7,58 (b, 1H, ZNH-); 8.47 (b, 3H, - 

20 CH 2 NH 3 C1) . MS-FAB m/z 324 (M+l). 

EXAMPLE 75 

N- (/3-Methoxy-a-methylacryloyl) -N' - IN" - (2 -Z- aminoethyl) -N' ' - 
(ethoxycarbony line thy 1) -2 -aminoethyl] urea. 

/?-Methoxy-cx-methylacryloylchloride (0 .729g; 0 . 00542 
25 • mol) and silvercyanate (1.055 g; 0.00704 mol) were refluxed in 
dry toluene for 30 min. The suspension was subsequently cooled 
to room temperature, followed by addition of N- (2-Z- 
aminoethyl) -N- (2 -aminoethyl) glycine ethyl ester hydrochloride 
(2.00 g; 0.00542 mol) and diisopropylethylamine (2.36 ml; 0.014 
30 mol) . The mixture was stirred for 2 hours, then it was 
filtered to remove the salts and the salts were washed with 
methylene chloride (15 ml) . The filtrate was extracted with 
water (3 x 20 ml) and saturated aqueous sodium chloride (2 0 ml) 
and the organic phase was dried over magnesium sulphate and 
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evaporated to dryness, in vacuo. The resulting solid was 
washed thoroughly with ether and dried over sicapent, in vacuo. 
Yield 1.16g (46%), Mp. 73-76°C Anal, for C 22 H 32 N 4 0 7 found 

(calc.) C: 56.68 (56.88) H: 6.89 (6.94) N: 12.16 (12.06). 1 H- 
5 NMR (400 MHz, CDCl 3 ) ; 01.25 (t, 3H, -O^CHa) ; 1-68 (s, 3H, 
CH 3 OCHCHCH 3 ) ; 2.85, 3,27, and 3.38 (m's, 10 H, 2 x -HNCHjC&N- 
and -NqH 2 COO) / 3.67 (s, 3H, -OCH 3 ) ; 4.16 (q, 2H, -Q^CF^) ; 5,10 

(s, 2H f CgHs-CSs-); 6.12 (b, 1H, -CH 2 NHCONH-) ; 7.34 <m, 5H, CJk- 
CH 2 -); 7.41 (b, 1H, ZNH-); 8.94 (b, 1H, -CONHCO-). MS-FAB m/z 
10 465 (M + 1) . 

EXAMPLE 76 

N- (2-Boc-aminoethyl) -N- [2- (l-thyminyl) ethyl] glycine hydro 
trif luoroacetate . 

N- (/3-Methoxy-a-methylacryloyl ) -N' - [N' ' - ( 2 - Z- 
15 aminoethyl) -N' ' - (ethoxycarbonylmethyl) -2 -amino -ethyl] urea 
(1.146 g; 0.00247 mol) was boiled in 4 M hydrochloric acid for 
4 hours. Subsequently the solution was evaporated to dryness, 
in vacuo, and the resulting solid was suspended in 10% 
triethylamine in methanol (25 ml). Di- tert-butyl dicarbonate 
20 (1.24 g: 0.0057 mol) was added and the solution was stirred at 
room temperature for 1 hour followed by evaporation to dryness, 
in vacuo. The residue was purified by MPLC using an RP-8 
column and eluting with 14.4% acetonitrile in water containing 
0.1% trifluoroacetic acid. Yield 0.446 g (37%). Mp. 79°C 
25 (decomp.). Anal, for C 18 H 27 N 4 0 8 F 3 found (calc.) C: 43.68 (44.63) 
H: 5.30 (5.62) N: 11.64 (11.57). "H-NMR (400 MHz; D 2 0) ; 01.31 
• (s, 9H, t-Bu) ; 1.80 (s, 3H, T-CH 3 ) ; 3.37-4.17 (m's, 10H, 5 X 
CH 2 ) ; 7.42 (s, 1H, CH in T) . MS-FAB m/z 371 (M + 1); 271 (M - 
Boc+1) . 

3 0 EXAMPLE 77 

Thermal stability of Bis PNA 

The title aeg-PNA from Example 58 and the title bis 
aeg-PNA from Examples 55, 56 and 57 were used in a study to 
determine the thermal stability of (PNA) 2 /DNA triplex formation 
35 relative to oligonucleotide targets at pH 5, 7 and 9. The 
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deoxyoligonucleotides used as the targets were: I-CGC-AGA-GA3C- 
GC; and II-CGC-A3GA-GAC-GC. These target molecules are 
antiparallel . 

The study was carried out in lOOmM NaCl, 10 mNI Na- 
5 phosphate, 0.1 mM EDTA, The heating rate was 0.5 "C/min at 5- 
90 °C. 

The PNA and each bis PNA was independently bound to 
each of the targets and the Tm was determined at each of the 
pK ranges. The results are tabulated below. 



10 Sequence 

CciriDOund A 

H-TCT-CTT-T-egl-egl-egl- 
TTT - CTC-T- Lys -NH 2 
(SEQ ID NO:38) 



pH Target I Target II 



5 


69 


.0 


°C 


68 


.5 


°C 


7 


49 


.0 


°C 


■52 


.0 


°C 


9 


38 


.5 


°C 


41 


.0 


°C 



15 Compound B 

H-TJT- JTT-T-egl-egl-egl- 
T7T -CTC-T- Lys -NH 2 
(SEQ ID NO: 41) 

Compound C 
2 0 H-TCT-CTT-T-egl-egl-egl- 

T7T- JTJ-T-Lys-NH 2 
(SEQ ID NO: 52) 

Compound D 
■ H-TCT-CTT-T-Lys-NH 2 

25 (SEQ ID NO:53) 



5 


67.0 


°C 


65.0 


°C 


7 


64.0 


°C 


48.5 


°C 


9 


60.5 


°C 


39.0 


°C 


5 


66.0 


°C 


61.5 


°C 


7 


47.0 


°C 


60.0 


°C 


9 


37.5 


°C 


59.0 


°C 


5 


50.0 


°C 


55.5 


°C 


7 


40.0 


°C 


39.0 


°C 


9 






23.5 


°C 



Ccr.pound E 
H - 7 5 JT JT - Ly s NH 2 
(SEQ ID NO: 51) 
plus Compound D 



5 


46 


.0 


°C 


48 


.5 


°C 


7 


36 


.0 


°C 


44 


.0 


°C 


9 


37 


.0 


°C 


42 


.5 


°C 



30 The results of the study clearly show that a small but 

sianificant increase in Tm is obtained by linking the two PNA 
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together. This is shown by comparing the results obtained for 
Compound A with the results obtained for Compound D. The study 
also shows that no major difference is observed when comparing 
DNA targets of opposite polarity. 
5 The Tms of the compounds studied show a strong pH 

dependence for compounds that do not have pseudoisocytosine in 
the parallel hoogsteen strand. This pH dependence is accounted 
for by the necessary protonation of the cytosine in the 
Hoogsteen strand. This protonation is not necessary with the 

10 pseudoisocytosine for binding to occur. 

In compound B the cytosines in one of the linked 
strands of the compound were replaced by pseudoisocytosines and 
in Compound C the cytosines in the other strand of the linked 
strands were similarly substituted, to study the effect of pH 

15 on the thermal stability of the triplexes formed. These PNA 
showed thermal stability at acidic pH (5) comparable to that 
of bis PNA compound A. However in the complexes where the 
cytosine containing portion of the compound is ant i -parallel 
to the DNA target (and thus the pseudoisocytosine strand is 

20 parallel) almost no pH dependence of the Tm is observed. This 
was observed in Compound B with target I and Compound C with 
target II. These results indicate that the orientation directs 
the complex formation (anti-parallel-* Watson/Crick) . The pH 
dependence shown for Compound B with target II and Compound C 

25 with target I shows that the cytosine strands o'f these 
compounds are involved in Hoogsteen hydrogen binding. 
Compounds A thru C showed a very fast rate of formation upon 
■ cooling. This lack of a pronounced hysterisis in the melting 
behavior that is normally observed with two single strands of 

3 0 PNA binding to DNA is ascribed to the high local concentration 
of the now covalently linked second PNA strand. 

EXAMPLE 78 

Effect of Base Pair Mismatches on bi s - PNA/DNA , aeg-PNA 2 /DNA 
Thermal Stabilities (Tm) 

3 5 Three of the aeg-PNA studied in Example 61, Compound 

C H-TCT-CTT-T-egl -egl-egl-TTT- JTJ-T-Lys-NH 2 (SEQ ID NO: 52) 
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(PNA-C), Compound D H-TCT-CTT-T-Lys-NH 2 (SEQ ID NO: 53) (PNA-D) 
and Compound E H - T 3 JT JT - LysNH 2 (SEQ ID NO: 51) (PNA-E) were 
studied to determine the effect of binding to an oligonucleo- 
tide target containing a mismatch. 

5 Oligonucleotide PNA-C PNA-D + PNA-E 

5' -dCGC-A 3 -GAG-ACG-C-3' 60.0 °C 44.0 °C 

(SEQ ID NO: 56) 

5' -dCGC-A 3 -CAG-ACG-C-3' 27.0 °C 32.5 °C 

(SEQ ID NO: 57) 

10 5' -dCGC-A 3 -AGA-GAC-GC-3' 36.5 °C 34,0 °C 

(SEQ ID NO: 58) 

5' -dCGC-A 3 -TAG-ACG-C-3 23.0 °C 33.0 °C 

(SEQ ID NO: 59) 

5' -dCGC-A 3 -CAC-ACG-C-3' sll . 0 °C sll.O °C 

15 (SEQ ID NO:60) 

The sequence discrimination of the bis PNA (PNA-C) as 
judged from thermal stability measurements suffers a very high 
cost in stability (30 °C for a base mismatch) , reflecting the 
two- fold recognition process involving both PNA strands. 

2G EXAMPLE 79 

Strand Displacement Binding of Bis aeg-PNA' s 

A 32 P-end labeled EcoRl-PvuII fragment of the plasmid 
pTHa 12 was incubated with aeg-PNA in 100 (il 10 mM Na-phosphate, 
ImM EDTA, pH 7 for 6 0 minutes at 2 0 °C / and subsequently 

25 "treated with KMn0 4 (20mM for 15 sec) . Following precipitation 
and treatment with piperidine the samples were analyzed by 
electrophoresis in polyacrylamide sequencing gels and the 
radioactive DNA bands visualized by autoradiography. The 
following concentrations of PNA were used: 1 jxM t 3 /zM, 10 /xM, 

3 0 and 30 /iM, and the PNAs were compounds A, B, C and D from 
Example 77. A control was also run which contained no PNA. 

The results show the fragments expected for strand 
displacement. 
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EXAMPLE 80 

Binding affinity of Bis a g-PNA 

In order to study the binding properties of bis aeg- 
PNA's as compared to that of the unlinked aeg-PNA's the 
5 following aeg-PNA's and bis aeg-PNA' S were synthesized: 

Compound A H 2 N-TTCTCTCTCT-CONH 2 (SEQ ID NO;61) 
Compound B H 2 N-gly-TCTCTCTCTT-lys-CONH 2 {SEQ ID NO: 62) 
Compound C H 2 N - gly- TTCTCTCTCT - lys-Aha-lys-Aha-lys- 

TCTCTCTCTT-lys-CONH 2 (SEQ ID NO: 27) 
10 Compound D H 2 N- gly- TTCTCTCTCT - egl - egl - egl - TCTCTCTCTT - 1 ys - 

CONH 2 (SEQ ID NO: 63) 

Two standard 10 mer aeg-PNA's were synthesized 
opposite in orientation e.g. antiparallel (compounds A and B) , 
as per the procedures of Example 45 , and two bis aeg-PNAs were 

15 synthesized with two 10 mer sequences linked together via 
linking moieties. One of the bis aeg-PNAs (Compound C) was 
linked using Aha and lys groups previously described in Example 
47. The other bis aeg-PNA (Compound D) was linked using poly 
ethylene glycol linking moieties described in Example 55, The 

20 bis aeg-PNAs are identical except for the linking moieties. 

Dissociation constants {Y^s) for duplex DNA strand 
invasion were determined for each bis aeg-PNA, each single aeg- 
PNA and an equimolar mixture of the single aeg-PNAs. 
Hybridization was for 4 days at 37°C in 100 mM Na + (IX TMTB) . 

25 The DNA targets were 65 mer duplexes containing the 
* complementary sequence in opposite orientations. 

The Aha linked bis aeg-PNA (Compound C) , bound duplex 
DNA about 500 times better than the best single aeg-PNA 
(Compound B) . The PEG linked bis aeg-PNA bound as well as the 

3 0 best single aeg-PNA (Compound B) . The observed orientation of 
bis binding was with the Aha linker crossing the 5' end of the 
triplex. Bis binding to single stranded RNA and DNA targets 
was also evaluated and compared to individual aeg-PNAs. The 
Aha linked bis aeg-PNA bound ssDNA more than 100 times better 

3 5 than aeg-PNA. The preferred orientation has the linker 
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crossing the 5' end of the target strand. The individual aeg- 
PNAs bound ssRNA more than 500 times tighter than ssDNA. The 
bis aeg-PNA bound ssRNA 3 times better than the best binding 
single aeg-PNA, 

5 Bis aeg-PNA strand invasion was evaluated in the 

presence of Mg ++ and spermine. In previous experiments, Mg + * 
was shown to weaken PNA strand invasion while spermine 
completely inhibited binding. Mg++ and spermine resulted in 
weaker binding of Compound C, however detectable strand 

10 invasion was observed in the presence of spermine. 

The hybridization rate for Compound C invasion of 
duplex DNA was determined at two concentrations and compared 
tc invasion rates for several single PNAs . Compound C bound 
9 times faster than single PNA. The improved strand invasion 

15 by bis PNA is associated with a faster on rate. This may be 
due to the close proximity of the second, triple stranding aeg- 
PNA to stabilize the strand invaded aeg-PNA in the duplex. The 
second aeg-PNA may prevent the invaded aeg-PNA from being 
ejected from the duplex. 

2 0 To ensure that the above improved binding with bis 

aeg-PNA (Compound C) was not due to non-specific binding of the 
Aha linker. Compound C was hybridized with up to 1 /xM 
noncomplementary duplex target. No binding was observed. 

EXAMPLE 81 

2 5 ES/MS of Bis-aeg-PNA: DNA 

Compound A ATT GTA GAG AGA GAA T (SEQ ID NO: 64) 

The binding stoichiometry of a bis aeg-PNA and a 
single stranded DNA were determined by mass spectrometry using 
a Hewlett-Packard 59987A electrospray unit, 5989A quadrapole 

3 0 mass spectrometer with extended mass range, and a Hewlett 

Packard 1090 HPLC connected to the electrospray needle via an 
LC packings 1/100 splitter. Compound A a DNA single strand 
16mer with mass 4991 AMU (8 (iM in a 50 mM NH 4 0AC solution) and 
Compound C from Example 80, a bis aeg-PNA of mass 6012 (10 fxM 
3 5 in a 50 mM NH<0AC solution) were analyzed separately and as a 
mixture. The sample to be tested as a mixture was taken from 
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the 50 mM stock solutions in NH 4 Oac, incubated to 37 °C for 72 
hours, and cooled to 2 °C for 48hours. The samples are warmed 
to room temperature prior to testing. 50 pi of the stock each 
sample and the mixture was mixed with 75 fil of isopropanol and 
5 injected into a 50 fil loop which continuously feeds the mass 
spectrometer. The samples were all analyzed in negative ion 
mode and a minimum of 16 scans were averaged to determine the 
masses. The deconvolution of data was performed by Hewlett- 
Packard's electrospray deconvolution program. The observed 

10 mass of the single stranded DNA was 4990 AMU and the observed 
mass of the bis aeg-PNA was 6012 AMU. The observed mass of the 
mixture was 11005 AMU which corresponds to one DNA strand and 
one bis aeg-PNA strand e.g. a 1:1 ratio. The calculated mass 
for the triplex is 11002 AMU, within 0.03% of the calculated 

15 mass of 11005 AMU* 



EXAMPLE 82 

ES/MS of aeg-PNA, /DNA 

The binding stoichiometry of two single aeg-PNAs and 
a single stranded DNA containing the complimentary sequence, 
20 were determined by mass spectrometry using the apparatus of 
Example 81. 

Compound A ATT GTA GAG AGA GAA T (SEQ ID NO: 64) 

8 /xM single strand DNA (Compound A) was taken from a 
50 mM NH 4 0ac stock solution and mixed with 20 /iM single aeg-PNA 
25 (Compound C, Example 80) also taken from a stock solution of 
5C mM NH 4 0ac. The two samples are mixed together and incubated 
■ to 37 °C for 72 hours, and cooled to 2 °C for 48 hours. The 
samples are warmed to room temperature prior to testing. The 
experimentally found mass of the compound thus formed in the 
30 mixture was 10597 Da. The mass of the single aeg-PNA is 2802 
and the mass of the single stranded DNA target (Compound A) is 
4S90 Da. The total of one single stranded DNA and two of the 
aeg-PNAs is 10594 Da. This mass is the result of two to one 
stoichiometry e.g. two aeg-PNAs to one DNA. 
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EXAMPLE 83 

Transcription Initiation with single PNA, Trans PNA and Cis PNA 

Restriction fragments of three plasmids pT9C, pT9CT9C 
(pUC19 derivatives containing respectively the sequences T9C 
5 and T9CT9C) and pT9CA9GKS 9bluescript KS+ derivative containing 
a T9CA9G sequence) were isolated by digestion with PvuII and 
purification on polyacrylamide gels resulting in fragments of 
338 base pairs (pT9C) , 354 base pairs (pT9CT9C) and 477 base 
pairs (pT9CA9GKS) • PNA- DNA complexes were formed by incubating 

10 PNA with DNA fragments in 10 mM Tris-HCl pH 8.0 and 0.1 mM EDTA 
in a total volume of 15 fih for 1 hour at 37 °C. The reaction 
mixture was adjusted to contain a final concentration of 40 mM 
Tris-HCl pH 7.9, 120 mM KCl, 5 mM MgCl 2 , Oil mM DTT, and ImM of 
ATP, CTP, GTP and 0.1 mM of OTP and 5 /xCi 32 P UTP. The PNA 

15 used was T9C-lysNH 2 in each case. 

The three plasmids used provide respectively a single 
binding site for the PNA (mono) , a pair of binding sites on the 
same DNA strand (cis) , and a pair of binding sites on opposite 
strands of the DNA (trans) . Complexes were formed between the 

20 PNA's and each of the three plasmids with the PNA concentration 
at 0 m, 3 nM, 10 nM, 3 /iM and 10 (M. 

The transcriptions were initiated by addition of 100 
nM E. Coli RNA polymerase holoenzyme (Boeringer) . The mixtures 
(total volume of 30 fil) were incubated at 37 °C for 20 minutes 

2 5 and the RNA produced by transcription was subsequently 

recovered by ethanol precipitation. The RNA transcripts were 
analyzed on 8% denaturing polyacrylamide gels, and visualized 
by autoradiography. 

As viewed on the corresponding gel for the mono at a 

3 0 PNA concentration of 10 fiM is the production of a single RNA 

product having the size expected if transcription occurs at the 

PNA binding site. 

As viewed on the corresponding gel for the cis at a 
PNA concentration of 10 is the production of a single RNA 
3 5 transcript but transcription is shown to be more efficiently 
promoted by the presence of two oligo PNA's at the binding site 
arranged in cis. 
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As viewed on the corresponding gel for the trans PNA 
at concentrations of lOnM, 3 and 10 fM is the production of 
two RNA transcripts of the expected sizes if transcription is 
initiated of each of the two DNA strands and proceeds from the 
5 respective binding site to the end of the DNA fragment. 

In the gels where transcript RNA is seen, it is 
estimated that from 1 to 5 RNA molecules are being produced per 
DNA template molecule. 
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WE CLAIM: 

1. A compound comprising: 

a peptide nucleic acid strand, said peptide nucleic 
acid strand including at least one peptide nucleic acid unit 
having a pyrimidine heterocyclic base; and 

said pyrimidine heterocyclic base comprising a C- 
pyrimidine heterocyclic base or an iso-pyrimidine heterocyclic 
base. 

2 . A compound of claim 1 wherein said pyrimidine 
base is a C-pyrimidine heterocyclic base. 

3 . A compound of claim 2 wherein said C-pyrimidine 
heterocyclic base is pseudo-isocytosine . 



heterocyclic base is pseudo-uracil . 

5 . A compound of claim 2 wherein said iso-pyrimidine 
heterocyclic base is 5-bromouracil . 

6 . A compound of claim 1 wherein said pyrimidine 
base is an iso-pyrimidine heterocyclic base, 

7. A compound of claim 6 wherein said iso-pyrimidine 
heterocyclic base is iso-cytosine . 

8. A compound of claim 1 wherein said peptide 
nucleic acid strand includes a compound of the formula: 



4 . 



A compound of claim 2 wherein said C-pyrimidine 



A 



L 




A 



L 



n 



n 




I 



wr.arein : 
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n is at least 2, 

each of L 2 -L n is independently selected from the group 
consisting of hydrogen, hydroxy, (C^-CJ alkanoyl, naturally 
occurring nucleobases, non-naturally occurring nucleobases, 
aromatic moieties, DNA intercalators, nucleobase -binding 
groups, heterocyclic moieties, and reporter ligands; 

each of C 1 -^ is (CR 6 R 7 ) y where R € is hydrogen and R 7 is 
selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R 6 and R 7 are 
independently selected from the group consisting of hydrogen, 
(C ; -C 6 ) alkyl, aryl, aralkyl, heteroaryl, hydroxy, (C x -C 6 ) alkoxy, 
(C : -C 6 ) alkylthio, NR 3 R 4 and SR S , where R 3 and R 4 are each 
independently selected from the group consisting of hydrogen, 
(C : -C 4 ) alkyl , hydroxy- or alkoxy- or alkylthio-substituted (C a - 
C 4 ) alkyl, hydroxy, alkoxy, alkylthio and amino, and R 5 is 
hydrogen, (C^-C^) alkyl, hydroxy-, alkoxy-, or alkylthio- 
substituted (Cj-^) alkyl, or R 6 and R 7 taken together complete 
an alicyclic or heterocyclic system; 

each of D 2 -D n is (CR*R 7 ) 2 where R 6 and R 7 are as defined 

above ; 

each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 1 but not more than 10; 

each of G^G 11 " 1 is -NR 3 CO-, -NR 3 CS-, -NR 3 SO- or 
-NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 

each of A x -A n and B*-B n are selected such that: 

(a) A is a group of formula (Ila) , (lib) , (lie) or 
(lid), and B is N or R 3 N\- or 

(b) A is a group of formula (lid) and B is CH; 



(Ila) 



(lib) 
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where : 

X is O, S # Se, NR\ CH 2 or C(CH 3 ) 2 ; 

Y is a single bond, 0, S or NR\- 
each of p and q is zero or an integer from 1 to 5, the sum p+q 
being not more than 10; 

each of r and s is zero or an integer from 1 to 5, the 
sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from the 
group consisting of hydrogen, (C^-Cjalkyl which may be 
hydroxy- or alkoxy- or alkylthio- substituted, hydroxy, alkoxy, 
alkylthio, amino and halogen; and 

each R 3 and R 4 are as defined above; 

Q is -C0 2 H, -CONR'R'', -S0 3 H or -S0 2 NR'R' ' or an 
activated derivative of -C0 2 H or -S0 3 H; and 

I is -NHR' ' 'R' ' ' ' or -NR' ' ' C (0) R' ' ' ' , where R' , R" , 
R' ' ' and R' ' ' ' are independently selected from the group 
consisting of hydrogen, alkyl, amino protecting groups, 
reporter ligands, intercalators, chelators, peptides, proteins, 
carbohydrates, lipids, steroids, nucleosides, nucleotides, 
nucleotide diphosphates, nucleotide triphosphates, oligonucleo- 
tides, oligonucleosides and soluble and non-soluble polymers. 



' WO 96/02558 



PCT/US95/09084 



- 103 - 



9. A compound of claim 1 wherein said peptide 
nucleic acid strand includes a compound of the formula III, IV 
or V: 



R 1 



0 



CCH 2 D k 



0 



R 



7 ■ 



N 



CCH 2 D 



L 

I 



0. CCH 2 3, 




< CH 2 D k 



n 




R 



N. 



' NH- R 





wherein : 
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each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
naturally occurring nucleobases, and non-naturally occurring 
nucleobases ; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer greater than 1, 
each k, 1, and m is, independently, zero or an integer from 1 
to 5; 

each p is zero or 1; 

R h is OH, NH 2 or -NHLysNH 2 ; and 

R 1 is H or COCH 3 . 

10. A compound comprising a first peptide nucleic 
acid segment and a second peptide nucleic acid segment, 
wherein: 

said segments are joined via at least one linking 
segment ; and 

said linking segment is not a peptide nucleic acid or 
an oligonucleotide. 

11. A compound of claim 10 wherein said linking 
segment is of the formula: 

- [HN-Z-C(=0)] n - 

wherein : 

n is 1 to 3; and 

Z is C x to C 20 alkyl, C 2 to C 20 alkenyl, C 2 to C 20 
alkynyl, C x to C 20 alkanoyl having at least one O or S atom, C, 
to C 3 , aralkyl, C 6 -C 14 aryl or an amino acid. 

12. A compound of claim 10 wherein said linking 
segment includes at least one unit of an aminoalkylcarboxylic 
acid of the formula 

-NH- (CH 2 ) e -C(=0) - 

where e is 1 to 15. 
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13 . A compound of claim 12 wherein e is 4 to 8 . 

14 . A compound of claim 13 wherein e is 5 or 6 . 

15. A compound of claim 12 wherein said linking 
segment further includes at least one amino acid. 



16. A compound of claim 10 wherein said linking 
segment comprises a compound of the formula 

- (AA) h - [NH- (CH a ) e -C{«0) - (AA) f ] g - 

where : 

AA is an or -amino acid; 
e is 4 to 8; 
f and h are 0 or 1; and 
g is 1 to 4 . 



17 . A compound of claim 10 wherein said linking 
segment includes at least one unit of a glycol amino acid. 

18. A compound of claim 17 wherein said glycol amino 
acid comprises glycol sub-units that are linked together in a 
linear array and that have an amino group on one terminus and 
a carboxyl group on the other terminus. 

19. A compound of claim 10 wherein said linking 
segment comprises a compound of the formula 

- [NH- (CH 2 -CH 2 -0-) r CH 2 -C(=0) -] i 

wherein : 

j is 1 to 6; and 
i is 1 to 6. 



20. A compound of claim 19 wherein j is 2 and i is 

3 . 

21. A compound of claim 10 wherein said peptide 
nucleic acid segments are joined together via two of said 
linking segments to form a cyclic structure. 
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22. A compound of claim 10 wherein said linking 
segment connects a terminal amine function on one of said first 
and second peptide nucleic acid segments to a carboxyl function 
on the other of said first and second peptide nucleic acid 
segments . 

23. A compound of claim 22 wherein said first peptide 
nucleic acid segment has a nucleobase sequence determined in 
a direction from its amine terminus to its carboxyl terminus, 
said second peptide nucleic acid segment has a nucleobase 
sequence determined in a direction from its carboxyl terminus 
to its amine terminus, and said sequences are the same. 

24. A compound of claim 10 wherein at least a portion 
of nucleobases of said first and second peptide nucleic acid 
segments are pyrimidine nucleobases. 

25. A compound of claim 24 wherein at least one of 
said pyrimidine nucleobases of one of said first or said second 
peptide nucleic acid segments comprises a C-pyrimidine 
heterocyclic base or an iso-pyrimidine heterocyclic base. 

26. A compound of claim 24 wherein said portion of 
said nucleobases that are pyrimidine nucleobases are located 
in contiguous homopyrimidine sequences. 

27. A compound of claim 10 wherein said linking 
segment comprises a carboxylic acid functional group and a 
primary amino functional group. 

28. A multiple stranded structure comprising: 

a nucleic acid strand, at least a portion of 
which forms a target nucleotide sequence; and 

a further strand, said further strand including 
first and second peptide nucleic acid segments that 
are joined together via a linker; 
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wherein: 

said first peptide nucleic acid segment has a 
nucleobase sequence that is complementary to the target 
nucleotide sequence in the 5' to 3' direction of said target 
nucleotide sequence; and 

said second peptide nucleic acid segment has a 
nucleobase sequence that is complementary to the target 
nucleotide sequence in the 3' to 5' direction of said target 
nucleotide sequence. 

29. The structure of claim 28 wherein said nucleic 
acid strand is a single stranded DNA or RNA. 

30. The structure of claim 28 wherein said nucleic 
acid strand is a double stranded DNA. 

31. The structure of claim 28 wherein one of said 
first or second peptide nucleic acid segments exhibits 
Watson/Crick binding to said target nucleotide sequence and the 
other of said first and second peptide nucleic acid segments 
exhibits Hoogsteen binding to said target nucleotide sequence. 

32. The structure of claim 31 wherein said one of 
said first or second peptide nucleic acid segments that 
exhibits Hoogsteen binding to said target nucleotide sequence 
includes C-pyrimidine heterocyclic nucleobases or iso- 
pyrimidine heterocyclic nucleobases in at least one of the 
positions that are complementary to nucleobases in said target 
nucleotide sequence. 

33. The structure of claim 32 wherein said C- 
pyrimidine heterocyclic nucleobase or iso-pyrimidine 
heterocyclic nucleobase is pseudo-isocytosine, iso-cytosine, 
pseudo-uracil or 5-bromouracil . 
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34. A compound comprising: 

a first segment of joined peptide nucleic acid units 
having a first nucleobase sequence; 

a second segment of joined peptide nucleic acid units 
having a second nucleobase sequence determined in a direction 
from its carboxyl terminus to its amine terminus; 

a linker group linking said first and said second 
segments of peptide nucleic acid units. 

35. A compound of claim 34 wherein: 

said first segment of peptide nucleic acid units 
exrends from an amino end to a carboxyl end; 

said second segment of peptide nucleic acid units 
ex .ends from an amino end to a carboxyl end; and 

said linker group links said carboxyl end of said 
first segment of peptide nucleic acid units to said amino end 
of said second segment of peptide nucleic acid units. 

36. A compound of claim 34 wherein: 

said first segment of peptide nucleic acid units 
ex -ends from an amino end to a carboxyl end; 

said second segment of peptide nucleic acid units 
ex -ends from an amino end to a carboxyl end; and 

said second nucleobase sequence, determined in a 
direction from the carboxyl end to the amino end of said second 
segment of peptide nucleic acid units, and said first 
nucleobase sequence, determined in a direction from the amino 
er.d to the carboxyl end of said first segment of peptide 
nucleic acid units, are the same. 

37. The compound of claim 36 wherein said linker 
group links said carboxyl end of said first segment of peptide 
nucleic acid units to said amino end of said second segment of 
peptide nucleic acid units. 
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